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While  theories  of  reading  are  presumably  based  on  an  understanding  of 
orthographic  systems,  the  understanding  of  what  in  the  written  system  represents 
what  in  the  spoken  system  is  often  not  carefully  considered.    Studies  of  writing 
systems  and  the  typological  classifications  of  the  world's  script  types,  however, 
have  provided  insight  into  the  various  types  and  nature  of  orthographic 
representations,  and  it  appears  that  among  seemingly  diverse  scripts,  there  are  some 
striking  commonalities.  Linguistically  informed  analyses  of  various  scripts  tell  us 
that  orthographies  represent  different  "levels"  of  linguistic  knowledge.  What  is 
often  overlooked  in  reading  research  is  that  1)  orthographies  are  only  partially 
representative  of  any  particular  level  of  linguistic  knowledge.  For  example, 
alphabetic  scripts  only  partially  represent  phonology.  And  2),  orthographies 
usually  represent  aspects  of  more  than  one  level  simultaneously:  phonological 
(segmental  and/or  syllabic),  morphophonemic,  morphological,  lexical.  Hence, 
orthographic  systems  are  complex,  incomplete,  sometimes  conflicting 
representations  of  language. 

The  first  component  of  this  dissertation  will  bring  reading  acquisition 
theories  and  empirical  research  relating  to  reading  and  visual  language  processing 
together  with  the  linguistically  informed  theoretical  study  of  writing  systems.  My 
intention  is  to  demonstrate  that  orthographic  systems  (their  commonalties  and 


differences)  are  themselves  evidence  as  to  how  visual  language  is  processed,  and 
that  this  information  can  and  should  inform  reading  theory.    The  second 
component  is  a  discussion  of  what  I  see  as  a  major  implication  of  the  previous 
argument.  That  is,  the  complexity  of  orthographic  systems  indicates  that  learning 
the  associations  necessary  for  reading  involves  far  more  than  what  is  generally 
provided  by  way  of  instruction.  Hence,  reading  acquisition  involves  what,  in  many 
of  the  fields  of  cognitive  science,  is  called  implicit  learning  -a  process  whereby 
structured  information  is  induced  from  a  complex  stimulus  environment,  largely 
without  awareness  of  what  exactly  is  learned  or  how.  It  1S  generally  acknowledged 
that  learning  to  read  and  write  does  not  proceed  automatically  (as  does  first 
language  acquisition)  but  instead  requires  explicit  instruction.   The  notion  that 
reading  acquisition  alsoinvolves  a  large  degree  of  implicit  induction  (a  natural, 
efficient  cognitive  process)  has  major  theoretical  and  pedagogical  implications. 


VI 


CHAPTER  1 
TOWARD  A  THEORY  OF  READING  ACQUISITION:  AN  OVERVIEW 


Virtually  all  reading  research '  is  based  on  the  assumption  that  orthographies 

-  how  the  visual  symbols  are  systematically  arranged  --  are  representative  of  some 

aspect  of  the  structure  of  language.  The  assumption  that  the  representational 

relationship  between  speech  and  writing  is  straightforward  is  a  central  premise  in 

reading  research.  For  example,  alphabets  are  thought  to  be  representative  of  the 

sound  system  of  language.  With  English,  it  is  commonly  assumed  that  alphabets 

represent  phonological  segments  and  that  those  segments  combine  to  form  words  in 

a  fashion  similar  to  the  spoken  language.  Since  reading  research  most  often 

involves  alphabetic  scripts,  the  bulk  of  experimental  research  has  focused  on  how 

phonological  structure  is  used  in  reading.   Although  research  with  non-alphabetic 

orthographies  is  comparatively  limited,  the  premise  of  a  transparent  representational 

relationship  is  the  same.  For  instance,  Chinese  characters  are  generally  assumed  to 

represent  morphemes/words.  Syllabaries  used  for  Japanese  and  Korean  represent 

phonology  at  the  level  of  syllables.  The  notion  that  the  representational  relationship 

may  be  more  complex  than  this  is  less  often  addressed,  especially  in  the 

experimental  literature  and  in  reading  pedagogy. 

While  theories  of  reading  are  presumably  based  on  an  understanding  of  the 

orthographic  representation,  the  conception  of  what  in  the  written  system  represents 

what  in  the  spoken  system  is  often  not  carefully  considered.    Studies  of  writing 

systems  and  the  typological  classifications  of  the  world's  script  types,  however, 

1    I  refer  to  'reading  research'  throughout,  as  that  which  cuts  across  several  disciplines- 
psychology,  education,  linguistics,  and  oUiers,  all  of  which  begin  from  the  assumption  that 
writing  is  a  representation  of  speech. 


have  provided  insight  into  the  various  types  and  nature  of  orthographic 
representations.  Linguistically  informed  analyses  of  various  scripts  tell  us  that 
orthographies  predominantly  represent  different  "levels"  of  linguistic  knowledge 
(see  Daniels,  P.  and  Bright,  W,  1996  for  a  thorough,  linguistically  focused  survey 
of  the  world's  writing  systems).  For  example,  alphabetic  scripts  are  usually, 
predominantly  phonological.  What  is  not  so  clear,  indeed  often  overlooked  in 
reading  research,  is  that  1)  orthographies  are  only  partially  representative  of  any 
particular  level  of  linguistic  knowledge.  For  example,  alphabetic  scripts  only 
partially  represent  phonology  (Chomsky  and  Halle  1968;  Chomsky  1970;  Coulmas 
1989;  Miller  1994;  Daniels  1992;  1996).  And  2),  orthographies  usually  represent 
aspects  of  more  than  one  level  simultaneously:  phonological  (segmental  and/or 
syllabic),  morphophonemic,  morphological,  lexical,  phrasal  (Gleitman  and  Rozin 
1977;  Henderson  1982;  Sampson  1985;  Coulmas  1989;  Daniels  1992,  1996; 
Miller  1994). 

The  first  component  of  this  study  will  bring  together  reading  acquisition 
theories  and  empirical  research  relating  to  reading  and  visual  language  processing, 
with  the  linguistically  informed  theoretical  study  of  writing  systems.  My  intention 
is  to  demonstrate  that  orthographic  systems  (their  commonalties  and  differences)  are 
themselves  evidence  of  how  visual  language  is  processed,  and  that  this  information 
can  and  should  inform  reading  theory.  If  reading  does  involve  simultaneous 
interpreting  of  various  aspects  of  structure,  then  understanding  the  word 
recognition  process  will  require  evaluating  how  and  to  what  degree  each  level  of 
structure  is  encoded  in  an  orthography.  The  second  component  is  a  discussion  of 
what  I  see  as  a  major  implication  of  the  previous  argument.  That  is,  the  complexity 
of  orthographic  systems  indicates  that  learning  the  associations  necessary  for 
reading  involves  far  more  than  what  is  generally  provided  by  way  of  instruction. 
Hence,  reading  acquisition  involves  what,  in  many  of  the  fields  of  cognitive 


science,  is  called  implicit  learning  -  a  process  whereby  structured  information  is 

induced  from  a  complex  stimulus  environment,  largely  without  awareness  of  what 

exactly  is  learned  or  how. 

It  is  generally  acknowledged  that  learning  to  read  and  write  does  not 

proceed  automatically  (as  does  first  language  acquisition)  but  instead  requires 
explicit  instruction.  Further,  the  traditional  stuff  of  education  has  been  the 
measurement  of  explicit  learning  processes,  which  involve  the  selective  search  for 
information,  the  building  of  hypotheses  and  testing  them,  or  the  assimilation  of 
rules  following  explicit  instruction.  Since  teaching  intervention  is  known  to  be  a 
critical  factor  in  learning  to  read,  research  has  largely  been  devoted  to  developing 
theories  of  explicit  learning  -  phonics,  for  example.  The  notion  that  reading 
acquisition  alsojnvolves  a  large  degree  of  implicit  induction  (a  natural,  efficient 
cognitive  process)  has  major  theoretical  and  pedagogical  implications. 

Trends  in  Reading  Theory 
Theories  of  reading  generally  proceed  from  the  position  that  word 
recognition  involves  the  analysis  of  one  level  of  linguistic  structure.    Research  on 
how  words  are  best  learned  range  from  "phonic"  theories,  which  suggest  that  the 
graphic  system  is  a  code  for  the  phonological  system,  to  "whole  word"  theories, 
which  believe  that  words  are  learned  as  visual  gestalts.  Most  current  work  in 
reading  suggests  that  both  phonic  and  lexical-semantic  strategies  apply  in  learning 
to  read.  These  "dual  route"  theories  suggest  that  readers  can  read  words  using  one 
of  two  routes.  The  lexical  route  leads  directly  into  the  lexicon  and  is  accessed 
semantically.  The  phonic  route  leads  indirectly  to  the  lexicon,  with  readers  first 
applying  letter  sound  translation.  Interestingly,  most  dual  route  theories  posit  that 
these  routes  operate  independently  and  are  not  interactive.  So,  while  the 
frameworks  above  differ  with  respect  to  the  particular  access  route(s)  to  word 


recognition,  all  assume  the  reader  need  interpret  only  one  level  of  linguistic 
structure  in  order  to  recognize  words.  More  recently,  interactive  activation  models 
of  word  recognition  emphasize  that  word  recognition  is  a  single  interactive  process. 
Instead  of  alternate  recognition  pathways,  these  models  assume  that  access  of 
phonological  information  is  an  automatic  consequence  of  word  recognition. 

Because  theories  which  endorse  phonological  recoding  have  led  to  specific, 
testable  questions  about  reading,  there  has  been  a  great  deal  of  experimental 
research  in  this  area.  The  guiding  premise  of  this  view  is  that  the  relationship 
between  alphabetic  scripts  and  what  they  represent  is  primarily  phonological,  based 
on  phoneme-grapheme  correspondences.  Until  recently,  it  has  been  assumed  a 
priori  that  if  the  orthography  is  alphabetic,  the  representation  must  be  phonemic. 
There  is  a  great  deal  of  literature  and  experimental  research  which  examines  the 
relationship  between  reading  and  phonemes  wherein  there  is  considerable  evidence 
that  "phonological  awareness"  correlates  with  beginning  reading  success. 
Phonological  awareness  is  often  defined  as  the  ability  to  recognize  and  separate 
words  into  their  constituent  phonological  units.  Though  this  notion  may  be  subject 
to  different  interpretations*,  it  generally  refers  to  the  segmenting  and  blending  of 
phonemic  units  -  phonemic  awareness.  While  some  researchers  have  explored  the 
role  syllabic  (as  opposed  to  phonemic)  awareness  plays  in  reading  acquisition 
(Gleitman  and  Rozin  1973;  Treiman  1992;  Goswami  and  Bryant  1992),  the  greatest 
emphasis  is  put  on  phonemic  awareness  since  it  is  this  skill  that  has  been  most  often 
correlated  with  beginning  reading  (Adams  1990;  Byrne  and  Feilding-Barnsley 
1989,  1990,1993;  Ehri  and  Wilce  1980;  Ehri  1994;  Liberman,  Shankweiler 


Empirical  evidence  of  phonological  awareness  is  taken  from  experiments  involving  phonological 
awareness  tasks.  These  tasks  are  of  varying  complexity  and  difficulty  for  botli  children  and  adults 
The  easier  tasks  involve  tesung  knowledge  of  more  perceptually  salient  aspects  of  phonology- 
syllables-  rhyme  and  alliteration  tasks  and  syllable  counung.  Phonemic  awareness  or  phoneme 
discrimination  tasks  such  as  phoneme  counting/lapping,  phoneme  deletion  or  addition  are  much 
more  difficult.     Even  very  literate  adults  arc  not  uniformly  successful  with  phoneme  deletion 
tasks  (Scholes  and  Willis  1990),  making  unclear  the  degree  to  which  phonemic  awareness  results 
from  literacy  instruction. 


Liberman,  Fowler  and  Fisher  1977;  Morals,  Bertelson,  Cary  and  Algeria  1986; 
Stanovich,  Cunningham,  and  Cramer  1984 ).  Consequently,  it  is  widely  believed 
that  words  are  recognized  in  accordance  with  their  phonemic  structure  and  that 
beginning  reading  depends  on  awareness  of  that  structure.  Accordingly,  beginning 
readers  should  be  taught  to  analyze  strings  of  letter  to  sound  correspondences  and 
blend  them  together  to  form  words. 

Chapter  two  gives  an  overview  of  the  competing  reading  theories  and 
major  empirical  research  associated  with  them.  Two  relevant  points  arise  from  this. 
First,  reading  research  is  particularly  focused  on  how  readers  use  phonemic 
structured  And  second,  it  is  widely  believed  that  use  of  phonemic  structure  in 
reading  does  not  develop  automatically  but  must  be  explicitly  taught.  Accordingly, 
the  more  aware  readers  are  of  the  phonemic  correlation,  the  more  reading 
acquisition  should  be  facilitated.  I  intend  to  show  (in  following  chapters)  that  by 
focusing  primarily  on  phonemic  recoding,  reading  research  has  overlooked  some 
obvious  properties  of  alphabetic  writing  systems  and  has  largely  excluded  from 
investigation  the  processing  of  other  aspects  of  linguistic  structure.  Additionally, 
the  phonemic  perspective  endorses  the  notion  that  reading  acquisition  is  primarily 
an  explicit,  conscious,  goal  directed  learning  process,  a  perspective  that  this 
discussion  calls  into  question. 

The  Question  of  Representation:  Evidence  from 
Writing  Systems  and  Language  Processing 

What  I  believe  has  been  most  problematic  in  theories  of  reading  is  the 
ambiguity  about  what  in  the  graphic  system  represents  what  in  the  spoken  linguistic 
system.  The  premise  that  alphabetic  scripts  represent  phonological  systems  is 
fundamental  to  theories  of  phonological  recoding.  Ideally,  there  would  be  a  one-to- 
one  correspondence  between  graphemes  and  phonemes,  mirroring  the  language's 
3  This  is  due  to  the  fact  that  most  research  has  focused  on  reading  alphabetic  scripts. 


phonemic  system.  However,  owing  to  a  complex  of  factors  such  as  historical 
change,  dialect  variation,  intermittent  spelling  reform,  and  pressures  for  linguistic 
preservation,  most  alphabetic  scripts  fall  short  of  this  ideal.  Even  scripts  that  are 
quite  phonemically  regular  (e.g.  Dutch,  Italian)  deviate  from  this  ideal  in  many 
respects.  With  some  alphabetic  scripts  (e.g.,  English,  French),  direct  phoneme- 
grapheme  correspondence  is  minimal. 

Yet,  alphabetic  orthographies  do,  albeit  partially,  represent  phonology  at  the 
level  of  segments.  This  in  itself,  however,  is  an  inadequate  basis  for  concluding 
that  the  letter-phoneme  relationships  are  the  exclusive  means  of  representation  and 
processing.  Similarly,  there  is  no  reason  to  assume  that  lexical-semantic  word 
recognition  occurs  independently,  without  associations  to  smaller  linguistic  units 
present  in  the  orthography.  It  is  more  likely  that  the  letter-phoneme  connection  is 
but  one  aspect  of  the  representational  system  that  must  be  decoded  by  the  reader; 
the  lexical-semantic  connection  another.  It  is  likely  also  that  the  representational 
system  includes  linguistic  associations  between  the  phoneme  and  word  levels. 
Furthermore,  there  is  no  reason  that  word  recognition  should  rely  on  any  single 
pathway  at  a  time.    In  fact,  neurobiological  evidence  suggests  that  processing  any 
complex  visual  or  auditory  stimulus  relies  on  multiple  pathways  simultaneously 
(Galaburda  1996). 

I  believe  the  reasearch  focus  on  one  script  type  (alphabets)  has  resulted  in 
investigators  missing  what  alphabets  have  in  common  with  other  scripts.  Chapter 
three  discusses  the  variety  among  the  worlds  script  types  in  order  to  show  that 
among  diverse  scripts  there  are  some  striking  commonalities:  All  scripts  represent 
multiple  aspects  of  linguistic  structure,  and  they  are  all  partial  representations  of 
those  structures.  Having  illustrated  that  multilevel  representations  are  common 
across  various  scripts  and  script  types,  I  will  argue  that  processing  written  language 
(in  general)  is  much  more  complex  than  phoneme-grapheme  translation.  Instead, 


reading  involves  interpreting  multiple  levels  of  linguistic  structure,  and  also, 
extracting  information  that  is  not  overtly  encoded  as  linguistic  units. 

Because  discussions  of  how  readers  utilize  linguistic  structure  have  focused 
primarily  on  phonology,  specifically  phoneme-grapheme  correspondences,  there  is 
comparatively  little  focus  on  whether  readers  utilize  morphology  or  syllable 
structure.   This  oversight  is  striking  considering  the  central  role  that  the  constructs 
morpheme  and  syllable  play  in  theoretical  linguistic  accounts.  Chapter  four  will 
discuss  empirical  evidence  indicating  that  morphological  and  syllable  structure  is 
used  in  the  visual  processing  of  various  scripts.    Differences  in  the  way  people 
read  morphologically  complex  words  suggests  that  they  do  rely  on  morphological 
cues.  That  phonological  processing  involves  structures  other  than  phonemic 
suggests  that  readers  extract  a  variety  of  phonological  cues.  Further,  there  is 
considerable  evidence  that  readers  interpret  orthographic  regularities  --  statistical 
information  about  recurring  symbol  patterns  even  when  they  do  not  directly 
correspond  to  linguistic  units. 

In  examining  the  relationship  between  the  various  levels  of  linguistic 
representation,  orthographic  regularities,  and  word  recognition,  the  aim  is  to  gain 
insight  into  how  these  multiple  processes  affect  reading  and  reading  acquisition.    I 
suggest  that  commonalities  across  scripts  are  likely  to  be  a  result  of  human 
preferences  and  constraints  on  processing  visual  language.  Hence,  orthographies 
themselves  are  evidence  of  how  people  learn  to  read  and  such  evidence  can  and 
should  inform  reading  theory. 

Principles  oflmplicit  Learning 
The  larger  objective  of  this  study  is  to  bring  one  major  implication  of  the 
preceding  discussion  to  bear  on  reading  theory.  If,  as  I  am  arguing,  the  decoding 
process  involves  decoding  multiple  levels  of  structure,  then  there  is  much 


information  readers  learn  in  addition  to  what  they  are  explicitly  taught  and  indeed, 
even  before  they  are  explicitly  taught  decoding  strategies.  Hence,  in  addition  to 
(sometimes  in  spite  of)  what  readers  are  taught,  there  is  much  that  they  must 
implicitly  induce  in  order  to  learn  to  decode  an  orthographic  system  and  learn  to 
read.  If  various  levels  of  linguistic  structure  are  demonstrably  functional  in 
recognizing  words,  then  learners  must  analyze  much  of  the  structure  present  in  an 
orthography  without  explicit  instruction  or  explicit  knowledge  of  the 
representational  relationship  on  which  the  decoding  process  is  based.  So,  as 
readers  mature,  they  do  not  proceed  on  the  assumption  that  the  orthography  is 
phonemically  valid,  but  use  the  various  levels  of  structure  at  once  to  identify  lexical 
items.  Hence,  much  of  learning  to  read  can  be  thought  of  as  implicitlearninp.    In 
short,  most  of  us  learn  to  read  desp,  te  the  considerable  amount  of  information  that 
we  are  not  taught. 

The  term  implicit  learning  has  been  used  to  characterize  the  manner  in  which 
people  come  to  apprehend  complex  structured  information  from  a  stimulus 
environment,  where  acquisition  occurs  largely  independently  from  explicit 
knowledge  of  how  you  are  learning  and  what  exactly  you  have  learned.    Implicit 
learning  is  the  human  ability  to  derive  information  about  the  world  in  an 
unconscious,  non-reflective  way.  This  process  is  conceived  as  a  by-product  of  the 
application  of  attention  to  relevant  rule-governed  structures  in  the  environment,  be  it 
natural  language,  socially  prescribed  behaviors  or  of  complex  economic  systems 
(Winter  and  Reber  1994).  Hence,  implicit  learning  is  a  general  inductive  process 
that  derives  information  about  patterned  relationships  in  a  stimulus  environment, 
and  results  in  an  abstract4  and  tacit  knowledge  base  (Reber  1967,  1976,  1993; 
Berry  and  Dienes  1993;  Berry  and  Broadbent  1984,  1988;  Mathews  et  al.  1989; 
Mathews  1991;  Cleeremans  1993). 


4  The  issue  of  abstractaess  is  a  debated  one  which  will  be  discussed  further  in  chapter  5. 


This  is  not  to  suggest  that  complex  structures  cannot  be  learned  through 
conscious  analytical  strategies,  but  rather  that  implicit  operations  can  be  engaged  in 
the  analysis  of  structured  patterns,  even  when  these  regularities  have  not  been 
consciously  detected  (Winter  and  Reber  1994),  Furthermore,  implicit  mechanisms 
may  be  'better  at'  learning  structures  that  are  too  complex  for  conscious  analytical 
strategies.  That  is,  explicit  processes  are  resource  intensive,  requiring  explicit 
memory,  conscious  attention,  and  overtly  controlled  hypothesis  testing.  On  the 
other  hand,  implicit  mechanisms  are  automatic,  unconscious  and  as  such,  less 
resource  consuming,  thus  freeing  up  explicit  mechanisms  for  higher  order 
functions. 

Various  implicit  learning  paradigms  have  been  designed  to  experimentally 
explore  the  process  by  which  people  acquire  complex  knowledge  about  the  world, 
largely  independent  of  conscious  attempts  to  do  so.  Artificial  grammar  learning  is 
the  most  widely  replicated  paradigm.    In  theses  experiments,  subjects  learn 
complex  rule  governed  stimuli  generated  by  a  synthetic,  semantic  free,  Markovian, 
finite-state  grammar,  which  are  too  complex  to  be  learned  in  one  afternoon  in  the 
laboratory  (Reber  1993).    In  the  typical  study,  subjects  memorize  strings  of  letters 
in  the  synthetic  language  and  are  later  tested  for  their  knowledge  of  the  rules  of  its 
grammar  by  being  asked  to  make  decisions  concerning  the  well-formedness  of 
novel  strings  of  letters,  the  associations  of  which  are  determined  by  arbitrary  rules. 
The  empirical  findings  indicate  that,  when  the  underlying  structure  of  a  stimulus 
network  is  highly  abstract  and  complex,  subjects  become  quite  adept  at  judging  the 
grammaticality  of  letter  strings  thai  they  have  never  seen  before.  The  implication  is 
that  subjects  are  learning  abstract  regularities  of  the  stimulus  network. 

Although  implicit  learning  experiments  attempt  to  isolate  implicit  processes, 
the  literature  suggests  that  both  implicit  and  explicit  mechanisms  are  involved.  In 
general,  implicit  and  explicit  processes  should  be  viewed  as  cooperative  processes 
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with  interactive  components.  Furthermore,  these  processes  may  vary  in  their 
accessibility  to  conscious  awareness,  degree  of  generality,  and  degree  to  which 
these  processes  facilitate  each  other.  Implicit  and  explicit  mechanisms  operate  in 
parallel,  with  the  resultant  knowledge  being  a  synthesis  of  the  two  levels  of 
analysis.  Chapter  five  will  review  theories  and  empirical  research  with  implicit 
learning  phenomena,  with  the  intention  of  making  a  parallel  with  reading 
acquisition. 

Implicit  Learning  and  Reading  Acquisition: 
Processing  Commonalties 

It  is  widely  believed  that  the  prominent  characteristic  of  reading  acquisition 
that  distinguishes  it  from  learning  to  speak  is  that  reading  requires  conscious 
analysis  of  how  the  written  code  maps  on  to  spoken  language.  While  this 
observation  is  accurate  to  some  degree,  it  does  not  adequately  capture  the 
complexities  of  learning  to  read.  I  suggest  that  what  we  know  about  written 
language  processing,  the  commonalilies  among  writing  systems,  combined  with  the 
insights  of  implicit  learning  research,  permit  a  reconceptualization  of  the  process  of 
reading  acquisition.  I  believe  that  insight  into  the  interaction  between  explicit  and 
implicit  learning  operations,  provide  a  framework  for  this  conceptualization. 
Drawing  on  the  specifics  of  prior  chapters,  in  chapter  six,  I  will  argue  that  many 
of  the  circumstances  of  learning  to  read  parallel  that  of  an  implicit  learning  situation. 
My  goal  is  to  suggest  a  framework  whereby  learning  to  read  is  characterized  by  an 
interface  of  implicit  and  explicit  learning  processes,  which  operate  concurrently  in 
the  interest  of  efficient  processing. 

Practical  and  Theoretical  Implications 
Educators  rely  on  the  fact  that  learners  bring  crucial  implicitly  acquired 
knowledge  to  the  educational  stage.  Particularly  young  children  acquire  most  of 
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what  they  know  without  intention  to  leurn.  Yet,  at  the  same  time,  formal  education 
typically  proceeds  from  the  assumption  that  learning  complex  knowledge  is  an 
explicit,  goal-directed  process  where  information  is  presented  in  more-or-less 
factual  form,  and  subsequently  put  to  use.  It  is  interesting  that  while  formal 
educational  methods  rely  on  implicitly  acquired  knowledge,  just  how  implicit 
learning  plays  a  role  in  formal,  explicit  instruction  is  rarely  examined.  Instead, 
instruction  is  focused  on  learning  that  is  intentional  and  reasoned.  Consequently, 
formal  education  relies  on  the  dynamic  of  implicit  and  explicit  learning,  yet  tends  to 
restrict  its  scope  to  issues  of  explicit  instruction.  It  is  distinctively  the  interface  of 
implicit/explicit  learning  and  knowledge,  and  the  tendency  of  educators  and 
researchers  to  disregard  it,  that  has  resulted  in  many  of  reading's  contentious 
instructional  issues.  Moreover,  the  reconceptualization  of  reading  presented  here 
has  cross-disciplinary  theoretical  implications.  Chapter  seven  discusses  how  and 
why  the  proposed  framework  bears  on  both  specific  practical  concerns  and  general 
theoretical  questions. 

Concluding  Remarks 

This  study  is  predicated  on  the  assumption  that  the  literatures  in  reading, 
the  linguistic  study  of  writing  systems,  and  implicit  learning  can  be  usefully 
integrated  in  order  to  reconceptualize  the  fundamentals  of  learning  to  read.  I  believe 
that  there  is  considerable  insight  to  be  gained  from  this  synthesis  of  literatures  that 
are  not  often  joined.  In  turn,  an  analysis  of  reading  acquisition  from  this 
perspective  expands  the  literature  in  each  of  these  domains. 

Much  of  the  existing  research  on  learning  to  read  is  dominated  by  the 
following  assumptions:  alphabetic  letters  represent  phonemes  and  therefore,  the 
most  essential  part  of  learning  to  read  is  mastery  of  letter-phoneme  relationships. 
Also,  written  language  maps  on  to  spoken  language  in  ways  that  must  be 
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consciously  understood.  I  intend  to  show  that  both  of  these  premises  lead  to  an 
inadequate  characterization  of  reading,  and  so  are  in  need  of  revision. 

The  literature  in  implicit  learning  can  be  effectively  appropriated  in  order  to 
advance  our  understanding  of  reading  acquisition.  Within  the  disciplines  of 
psychology  and  computer  science,  there  is  abundant  evidence  that  implicit 
processes  are  efficient,  principled  cognitive  functions  that  are  very  good  at 
processing  certain  types  of  information.  On  the  basis  of  processing  evidence  and 
commonalties  across  scripts,  I  suggest  that  decoding  orthographic  representation  is 
one  of  these  types. 

A  natural  proclivity  for  processing  efficiency  is  the  basis  for  the  proposed 
conceptualization  of  the  learning  strategies  used  in  reading  acquisition.  The 
capacity  to  interpret  various  aspects  of  linguistic  structure  simultaneously  facilitates 
decoding,  making  the  reading  process  arguably  more  efficient  than  it  would 
otherwise  be.    Similarly,  employing  implicit  and  explicit  processes  to  a  learning 
task  is  a  matter  of  allocating  resources  where,  and  in  a  manner,  that  they  can  be 
most  effectively  used.  I  believe  that  an  understanding  the  dynamics  of  such 
learning  strategies  can  be  used  to  advance  discussions  of  reading  in  innovative 
ways. 


CHAPTER 2 
TRENDS  IN  READING  THEORIES  AND  PRACTICE 


Theories  of  reading  acquisition  which  attempt  to  specify  how  beginning 
readers  learn  to  recognize  words  are  numerous  and  elaborate  .  Teaching  practice 
and  educational  policy  are,  to  some  degree,  influenced  by  ongoing  research  in  this 
area.  This  chapter  will  review  the  dominant  trends1  in  reading  research  and  discuss 
how  they  have  influenced  teaching  practice  and  policy.  This  chapter  will  discuss 
the  strengths  and  weaknesses  of  various  reading  theories  and  instructional 
philosophies  in  order  to  demonstrate  that  each  captures  some  essential  property  of, 
or  insight  into,  the  reading  acquisition  process.  It  is  also  my  intention  to  point  out 
that  despite  the  quantity  of  reading  literature  and  the  insights  it  provides,  the  scope 
of  reading  research  has  been  sufficiently  narrow  such  that  it  has  not  given  due 
consideration  to  an  understanding  of  the  nature  of  human  learning  and  language. 
Bearing  in  mind  that  understanding  the  reading  process  must  include  an 
understanding  of  language  and  the  basic  operating  characteristics  of  the  human 
brain,  this  discussion  is  prelude  to  an  integration  of  theoretical  considerations  which 
are  largely  absent  from  the  reading  literature. 

Phonic  Theories  of  Reading 
Phonic  theories  are  premised  on  the  idea  that  alphabetic  orthographies 
represent  phonology  systematically.  Accordingly,  word  recognition  is  a  matter  of 
translating  the  orthography's  phonological  code  and  phonological  processing  is 

5  This  is  by  no  means  meant  lo  be  a  comprehensive  review  of  the  vast  amount  of  existing 
research.  Rather,  this  is  a  general  account  of  the  dominant  theoretical  trends  and  their 
contributions  to  understanding  reading  acquisition. 
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presumed  to  be  the  most  important  mechanism  in  learning  to  read  with  alphabetic 
scripts.  Basically,  theories  of  phonological  receding  suggest  that  children  learn  to 
read  by  converting  letters  into  sounds,  applying  phoneme-grapheme 
correspondence  rules,  and  then  recognizing  the  word  from  its  pronunciation 
(Liberman  et  al.  1977,  Gough  1972).    These  relationships  are  encoded  into  the 
lexicon  of  beginning  readers. 

Since  alphabetic  systems  are  thought  to  represent  phonological  systems,  it 
is  believed  that  reading  acquisition  hinges  on  what  has  been  widely  called 
phonological  awareness  --  generally,  the  ability  to  break  words  up  into  their 
constituent  phonological  units.   There  is  some  confusion  about  this  term  in  the 
literature6.  It  is  sometimes  used  to  mean  conscious  analysis  of  speech  at  the  level  of 
phonemic  segments.  Other  times  it  is  used  to  mean  a  general  perceptiveness  of  the 
units  of  speech,  not  necessarily  phonemic.  Empirical  evidence  of  phonological 
awareness  is  taken  from  experiments  that  involve  tasks  of  varying  complexity  and 
difficulty  among  children  and  adults.  The  easier  tasks  involve  testing  for 
knowledge  of  more  perceptually  salient  aspects  of  phonology  such  as  syllables. 
Evidence  of  syllable  awareness  comes  from  rhyme  and  alliteration  tasks  and 
syllable  counting.  Preschool  children  (Maclean  et  al.  1987;  Trciman  and  Zukowski 
1991)  as  well  as  nonliterate  adults  (Morais  et  al.  1986)  have  been  shown  to  be 
successful  with  syllable  awareness  tasks.  Phonemic  awareness  tasks  try  to  assess  a 
persons  knowledge  of  segments,  which  are  much  less  perceptually  salient  speech 
units.  These  tasks,  such  as  phoneme  counting  or  tapping,  phoneme  deletion  or 
addition,  are  much  more  difficult.    Usually,  children  begin  to  acquire  this  skill  as 

"Stanovich  (1992)  suggests  lhat  the  generic  term  phonological  sensitivity  be  used  to  cover  the  set 
ol  processing  constructs  being  tapped  by  the  various  tasks  used  in  research    Phonological 
sensitivity  should  be  viewed  as  a  continuum  ranging  from  deep  sensitivity  to  shallow  sensitivity 

rf^ft  STUV7  T  ,d  hKlC,haraCterized  boU1  ^  Ule  ****  to  &™  conscious,  explicit  reports' 
of  speech  units  and  also  the  ability  to  detect  less  perceptually  salient  units  like  phonemic 
segments.  Hence,  phoneme  segmentation  would  fall  al  the  deep  end  of  the  continuum  while 
syllabic  counting  would  be  al  the  shallow  end. 
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they  learn  to  read  and  spell.  However,  even  very  literate  adults  are  not  uniformly 
successful  with  some  phoneme  awareness  tasks  (Scholes  and  Willis  1990). 

Indeed,  there  is  strong  correlational  evidence  that  some  type  of  phonological 
awareness  is  a  key  factor  in  learning  to  read  with  alphabetic  scripts.    However,  it 
is  phonemic/segmental  awareness  that  is  most  commonly  associated  with  beginning 
reading  success  (Adams  1990;  Byrne  and  Feilding-Barnsley  1989,  1990,1993; 
Ehri  and  Wilce  1980;  Ehri  1994;  Iverson  and  Tumner  1993;  Morais  et  al.  1986; 
Stanovich,  Cunningham,  and  Cramer  1984 ).   The  significance  of  this  correlation 
has  been  widely  cited  to  support  phonics  instruction  as  a  pedagogical  strategy,  at 
least  among  beginning  readers.    Yet,  even  this  strong  connection  between  reading 
and  phonological  awareness  appears  to  be  only  part  of  the  story.  Although  children 
with  low  phonemic  awareness  skills  are  more  likely  to  lag  behind  in  beginning 
reading,  not  all  children  with  good  phonemic  segmentation  abilities  become  good 
readers.  Hence,  it  appears  that  phonemic  awareness  is  not  the  only  condition 
necessary  for  efficient  early  reading  (Juel,  Griffith  and  Gough  1986). 

While  it  is  evident  that  beginning  reading  involves  phonological  processes, 
phonic  theories  of  reading  leave  many  apparent  components  of  reading  acquisition 
unclarified.   Perhaps  the  most  obvious  oversight  of  phonological  recoding  theories 
is  that  it  is  assumed  that  alphabets  represent  phonemic  segments,  and  that  the 
visual-phonological  code  is  based  on  letter-sound  translations.  This  view  has  been 
supported  by  the  positive  correlation  between  phonemic  awareness  and  reading. 
Yet,  at  the  same  time,  this  view  has  been  very  problematic  for  writing  systems  like 
English,  for  which  letter-sound  translations  involve  a  complex  of  rules  and 
numerous  "exceptions."    This  fact  has  led  some  researchers  to  reconsider  the 
assumption  that  alphabetic  representation  is  not  strictly  phonemic.  For  instance, 
Goswami  (1986,  1988),  Treiman  (1992),  and  others,  have  found  that  readers  are 
sensitive  to  syllable  structure  alphabetic  scripts.  In  fact,  if  you  consider  the 
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phonological  representation  in  English  script  to  include  word  analysis  into  onset 
and  rime,  the  number  of  phonemic  rules  and  exceptions  are  dramatically  reduced 
(Tretman  1992).  Following  this  line  of  reasoning,  we  must  consider  other  levels  of 
representation  possible  with  alphabetic  orthographies:  morphophonemic, 
morphemic,  etc. 

Another  issue  of  some  difficulty  to  phonological  recoding  theories  is  that  the 
extent  of  phonological  awareness  necessary  to  facilitate  recoding  is  unclear. 
Moreover,  it  is  unclear  whether  or  to  what  degree  phonological  awareness  need  be 
explicitly  taught.  It  is  often  argued  that  phonemic  awareness  only  develops  with 
explicit  instruction  or  acquisition  of  an  alphabetic  script  (Adams  1 990),  and  that  the 
"alphabetic  principle"  must  be  consciously  understood  to  be  used  properly 
(Liberman  and  Liberman  1992).  This  premise  has  provided  a  strong  rationale  for 
phonics  instruction.  Accordingly,  the  more  aware  readers  are  of  the  phonemic 
correlation,  the  more  reading  acquisition  should  be  facilitated.  Yet,  even  for  those 
who  endorse  this  view,  the  degree  of  phonemic  awareness  necessary  to  facilitate 
beginning  reading,  and  the  degree  of  instruction  necessary  to  facilitate  phonemic 
awareness,  are  matters  of  debate.  For  instance,  some  researchers  suggest  that 
phonemic  awareness  can  develop  spontaneously  in  learning  to  read  and  spell  a  large 
number  of  words.  In  this  view,  readers  implicitly  internalize  how  sound  maps  on 
to  printed  words.  Hence,  phonological  awareness  is  consequent  to  reading, 
regardless  of  instructional  method  (Gough  and  Hillinger  1980;  Gough,  Juel  and 
Griffith  1992).     Also,  many  researchers  regard  phonemic  awareness  as  much  a 
consequent  as  a  facilitator  to  learning  to  read  (Bradley  and  Bryant  1983;  Ehri  1984, 
1994;  Mann  1987;  Perfetti  1985;  Perfetti  et  al.  1987;  Morais  et  al.  1986;  Prakash  et 
al.  1994).  This  bi-directional  language  awareness  phenomena  makes  it  difficult  to 
assess  the  degree  to  which  phonemic  awareness  is  necessary  for  beginning  reading. 
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Finally,  there  is  the  issue  that  successful  readers  who  are  taught  to  recode 
letters  to  phonemes  use  this  strategy  for  only  a  brief  period.    Beginning  letter-by- 
letter  readers  decode  words  at  the  rate  of  1  letter  per  second.  Skilled  readers 
process  approximately  100  letters  per  second  .  This  means  that,  with  practice, 
recoding  becomes  an  automatic  process  -  a  permanent  set  of  associative 
connections  in  long-term  memory  that  develops  with  practice  (Schneider  and 
Shiffrin  1977).  What  is  problematic  in  phonological  recoding  theories  is  that  it  is 
unclear  whether  these  automatic  connections  operate  at  the  phonemic  level  or  at 
some  other  level,  such  as  the  word.  If  automatic  connections  are  to  word-sized 
units,  then  phonological  processing  may  be  may  be  used  only  in  the  very  early 
stages  of  reading  acquisition.  This  issue,  along  with  others  discussed  above  have 
led  to  competing  theories  of  reading. 

Whole-  Word  Theories 
An  alternative  view  of  reading  is  that  the  representational  system  is 
lexical/semantic.  Readers  learn  associations  between  visual  forms  of  whole  words 
and  their  semantic  referents.  Insight  reading,  or  whole-word  reading  theories, 
words  are  stored  in  memory  as  visual  gestalts  as  a  result  of  much  practice  reading 
the  words  (Smith  1972).  Readers  recognize  sight  words  without  attending  to  the 
individual  letters  or  segmenting  the  words  into  their  component  sounds.   This 
theory  presumes  that  even  children  who  are  explicitly  taught  phonemic  recoding 
rapidly  abandon  that  strategy  in  favor  of  identifying  whole  words. 

One  source  of  evidence  taken  in  support  of  whole-word  theories  comes 
from  research  on  eye  movements  in  reading.  Readers  make  a  series  of  rapid  eye 
movements  called  saccades,  separated  by  fixations]  pauses  that  last  about  100-250 
msec  each.  Virtually  all  new  information  is  extracted  from  the  text  during  these 
fixational  pauses.  The  function  of  the  saccade  is  to  move  the  eyes  rapidly  from  one 
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position  to  another,  with  the  average  saccade  length  being  about  7-9  characters,  or  a 
bit  over  one  word  (Rayner  and  Pollalsek  1987:  239).  Additionally,  experiments 
using  a  tachistope  -  equipment  that  can  present  information  to  the  eyes  for  a  very 
brief  period  -  have  shown  that  readers  can  identify  words  as  easily  and  quickly  as 
random  letters  (Kolers  and  Katzman  1966;  Newman,  1966  in  Smith  1988).  It  can 
be  argued  that  if  words  are  just  as  easily  identified  as  letters,  then  readers  must 
perceive  words  as  units,  just  as  they  do  letters.  Also,  if  words  can  be  identified  just 
as  quickly  as  letters,  then  recognition  need  not  involve  letter  by  letter  analysis. 
Further  evidence  for  the  sight  reading  perspective  comes  from  studies 
which  conclude  that  phonological  recoding  has  limited  application  to  learning  the 
graphic  representations  of  words  because  it  is  a  strategy  employed  only  in  short- 
term  memory.  There  is  evidence  that  phonic  recoding  occurs  only  if  the 
orthographic  material  is  to  be  temporarily  remembered  (Mattingly  1984),  but  that 
long-term  memory  is  semantically  structured.  Therefore,  words  must  be 
represented  in  memory  as  whole  units. 

Whole-word  theories  do  reflect  the  intuitive  notion  that  we  read  words 
rather  than  strings  of  letters,  however,  they  also  have  some  apparent  weaknesses. 
Evidence  collected  to  support  this  view  does  not  explain  how  words  are  recognized 
as  wholes.  Nor  does  it  explain  what  words  "look  like"  to  the  reader.  That  is,  if 
there  is  no  reliance  on  letters,  then  how  are  word  shapes  are  defined?  Furthermore, 
there  is  abundant  empirical  evidence  that  phonological  processes  of  various  types 
are  involved  in  reading  for  both  children  and  adults. 
Dual-Route  Theories 
Despite  the  inadequacies  of  whole-word  theories,  and  despite  the  evidence 
that  reading  involves  some  kind  of  phonological  processing,  everyone  agrees  that 
skilled  readers  process  written  language  much  too  fast  to  be  performing  serial  letter- 
to-sound  translations.    Dual-route  theones  have  attempted  to  accommodate  these 
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facts  by  integrating  both  phonological  processing  and  sight-word  analysis  (Barron 
and  Baron  1977;  Baron  1977;  Coltheart  et  al.  1977;  Bryant  and  Bradley  1980;  Frith 
1979).  These  theories  suggest  that  readers  can  read  words  using  one  of  two  routes. 
The  lexical  route  leads  directly  into  the  lexicon  and  is  accessed  semantically.  The 
phonic  route  leads  indirectly  to  the  lexicon,  with  readers  first  applying  letter  sound 
translation.  Most  dual-route  theories  posit  that  these  routes  operate  independently. 
Or,  if  both  routes  operate  in  parallel,  the  fastest  process  will  yield  recognition.  In 
either  case,  the  pathways  are  separate,  not  connected.  For  words  that  are 
previously  learned  and  stored  in  memory,  the  visual-semantic  route  is  faster 
because  it  does  not  require  phonological  mediation.   For  unfamiliar  or  difficult 
words,  the  phonological  route  would  be  engaged  first. 

When  words  are  recognized  via  the  visual-semantic  route,  their  visual  forms 
are  associated  with  the  word's  meaning  which  has  been  stored  in  memory.  The 
visual  form  itself  may  be  recognized  in  terms  of  spatial  cues  that  may  take  the  form 
of  permissible  orthographic  sequences,  sequence  length,  boundary  letters,  etc.  The 
associations  are  made  through  repeated  contact  with  words  until  they  are  stored  in 
memory.  Once  stored  in  memory,  word  recognition  is  immediate,  via  directaccess. 
However,  these  visual  sequences  are  word  specific  and  they  do  not  represent 
linguistic  structure  below  the  word  level.  This  means  that  any  orthographic 
systematicity  within  an  individual  word  is  not  relevant  to  identifying  other  words. 

The  phonological  route  converts  words  into  their  pronunciations  by 
applying  receding  rules.  The  pronunciations  are  then  used  to  access  the 
semantically  stored  words.  Although  receding  improves  with  practice,  it  is  never 
as  fast  as  whole-word  access.  Further,  the  letter-to-sound  correspondences  for 
particular  words  are  not  part  of  the  reader's  memory  representation  for  those 
words.  For  those  who  have  been  taught  to  read  by  phoneme-letter  analysis, 
phonological  recoding  is  primarily  an  initial  strategy  that  is  bypassed  when  they 
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have  developed  automaticity  in  word  identification.   Hence,  when  the  words 
readers  have  seen  many  times  before  become  stored  in  memory,  phonological 
receding  is  unnecessary  and  not  used  (Gough  1984;  Barron  and  Baron  1977; 
Singer  1984).  In  fact,  the  difference  between  good  and  poor  readers  is  often  a 
question  of  moving  beyond  letter  by  letter  analysis  and  developing  automaticity  in 
word  recognition.  For  skilled  readers,  phonological  recoding  is  thought  to  be  used 
mainly  for  unfamiliar  or  difficult  or  nonsense  words  that  have  not  been  stored  in 
memory. 

In  effect,  dual-route  theories  posit  that  readers  will  use  both  the  semantic 
and  phonological  routes  to  access  words  regardless  of  how  they  are  taught  to  read. 
Beginning  readers  who  have  learned  to  read  words  by  sight  start  out  memorizing 
whole-words  later  add  phonological  skills  to  the  reading  task.  Phonics  trained 
readers  may  have  superior  recoding  skills  at  first  but  whole-word  readers  also  learn 
phonological  relationships  as  their  written  vocabulary  grows.  Research  has 
indicated  that  readers  who  are  not  taught  phonics  rules  eventually  induce  letter- 
sound  relationships  implicitly  as  they  practice  reading,  and  as  they  learn  to  spell 
(Gough  and  Hillinger  1980;  Ehri  and  Wilce  1987;  Gough,  Juel  and  Griffith  1992; 
Perfetti  1992;Treiman  1993). 

The  dual-route  models  capture  the  notion  that  letter-by  letter  analysis  is  not 
always  prerequisite  to  word  recognition.  It  also  captures  the  fact  that  recoding  is  a 
more  time  consuming  process  than  directly  accessing  whole  words.  Yet,  one 
fundamental  difficulty  with  dual-route  theories  (also  with  whole-word  theories)  is 
that  sight-word  recognition  does  not  rely  on  systematic  patterning  across  words. 
Consistent  phonological  and  orthographic  patterns  do  not  influence  the  direct  access 
of  words  because  each  word  is  stored  as  an  individual  unit.    Since  a  basic  fact  of 
human  cognition  is  that  we  exploit  the  structure  available  to  us,  it  is  highly  unlikely 
that  readers  would  or  could  overlook  patterned  structure  in  the  stimuli. 
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Another  basic  problem  is  that  the  semantic  and  phonological  pathways  are 
not  interactive.  Except  for  difficult  or  unfamiliar  words,  the  phonological  pathway 
should  be  bypassed  completely  for  skilled  readers.  Yet,  skilled  readers  have  been 
found  to  be  sensitive  to  the  phonological  constituents  of  words  (Treiman  and 
Zukowski  1988;  Feldman  1987;  Treiman,  Goswami  and  Bruck  1990).  Ehri  (1992) 
addresses  this  problem  by  suggesting  that  there  are  connections  between  the  visual 
(whole  word)  route  and  the  phonological  route.  In  this  theory  there  are  connections 
in  memory  between  a  word,  its  sequence  of  letters  in  spelling,  and  its  phonemes  in 
pronunciation.  Thus,  learning  new  words  involves  making  systematic 
phonological  connections  and  storing  the  phonological  information  in  memory  with 
the  semantic  information. 

A  third  difficulty  has  to  do  with  specifying  only  two  pathways:  phonemic 
and  semantic.  Phonological  recoding  is  thought  to  rely  on  a  system  of  rules 
whereby  a  visual  word  is  translated  into  its  phonological  code.  However,  it  has 
proved  immensely  difficult  to  specify  the  rules  governing  phoneme-grapheme 
correspondences.  This  undoubtedly  has  to  do  with  the  fact  that  orthographic 
systems  are  more  than  just  a  set  of  rules  that  map  phonemes  to  letters.  Yet, 
linguistic  associations  above  the  phonemic  level,  and  below  the  word  level,  are  not 
part  of  the  word  representation  in  dual-route  theories.  I  will  address  at  length  the 
idea  that  orthographies  encode  several  types  of  structure  beyond  the  phoneme  in 
chapters  3  and  4. 

Interacti  ve  Activation  Theories 
Interactive  parallel  activation  theories  of  reading,  which  have  developed 
from  interactive  theories  of  speech  perception,  try  to  adjust  some  of  the 
shortcomings  of  dual-route  theories  (McClelland  and  Rumelhart  1981 ;  Seidenberg 
1985,  1987).  One  innovation  of  this  approach  to  reading  is  that  it  emphasizes  that 


word  recognition  is  a  single  interactive  process.  Instead  of  alternate  recognition 
pathways,  these  models  assume  that  access  of  phonological  information  is  an 
automatic  consequence  of  word  recognition.  Recognition  is  initiated  by  extracting 
visual  information  from  the  input  where  readers  do  rely  on  systematic  patenting 
across  words.  As  salient  orthographic  units  are  recognized,  they  activate 
phonological  representations.  Consequently,  phonological  access  lags  behind 
visual  analysis.  Direct  access  occurs  when  there  is  enough  orthographic 
information  to  recognize  words  prior  to  phonological  access.  This  is  the  case  for  a 
large  number  of  common  words.    However,  phonology  is  still  activated  for  words 
retrieved  via  direct  access,  though  it  is  after  recognition,  making  phonology  a 
necessary  consequence  of  recognition  . 

Interactive  processing  theories  posit  that  readers  do  not  rely  on  a  set  of 
explicit  rules  to  make  these  systematic  associations  (e.g.,  rules  of  phoneme- 
grapheme  translation).  Alternatively,  specifying  the  processing  system  and  the 
knowledge  encoded  therein  is  a  matter  of  specifying  this  pattern  of  connectivity 
among  the  processing  units.  There  is  no  storage  bin  of  lexical  items,  the  "lexicon" 
is  in  the  connections  themselves;  a  map(s)  of  associations.  The  structural 
representations  are  phonological  and  semantic.  The  phonological  representations 
are  orthographic  patterns  of  associations.  The  semantic  representations  are  the 
meanings  of  units.  The  connections  between  the  structural  representations  will 
form  "maps,"  which  will  have  similar  items  located  near  each  other.  The  maps  will 
have  densely  interconnected  associative  connections.  Output  representation  is 
obtained  from  the  weight  vector  of  the  most  highly  active  unit.  Maps  and  the 
connections  between  them  are  organized  simultaneously,  based  on  examples  of  co- 
occurring  symbols  and  meanings.  (Seidenberg  1989,  1992,  Seidenberg  and 
McCLelland  1989). 
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The  way  an  interactive  activation  model  applies  to  reading  is  as  follows. 
The  template  (visual  image)  broadcasts  activation  to  words  in  the  lexicon 
simultaneously,  with  each  feature  of  the  template  contributing  a  small  amount  of 
activation  to  each  word  that  has  that  same  feature.  Many  words  will  receive  some 
amount  of  activation,  but  the  word  that  has  more  than  any  other  is  eventually 
accessed.  High  frequency  items  need  less  activation  than  low  frequency  ones  to  be 
accessed.  Each  word  unit  stands  for  a  particular  word  and  has  connections  to  each 
letter  in  the  word.  While  there  are  no  separate  units  for  regularly  occurring  letter 
sequences,  all  systematically  occurring  sequences  are  represented  as  maps  of 
associations,  and  hence,  are  an  intrinsic  part  of  the  lexicon.   The  structural 
representations  are  commonly  thought  to  be  phonological  and  semantic. 
Theoretically,  however,  since  activation  will  form  associative  maps  for  all 
systematic,  reoccurring  sequences,  morphemic  and  morphophonemic  patterns  (if 
present  in  the  visual  stimulus)  would  be  associated  via  the  same  process. 

Because  interactive  activation  models  rely  on  identifying  letter  sequences 
that  recur  in  a  systematic  way,  what  are  sometimes  called  analogy  models  fall  into 
this  theoretical  grouping.    That  is,  people  read  words  by  activating  known  words 
that  have  letter  sequences  in  common  with  the  stimulus.  For  instance,  phonological 
representations  are  associated  with  chunks  of  letter  strings.  So,  an  unfamiliar  letter 
string  like  void  will  activate  a  set  of  familiar  words  like  paid,  maid,  said  and 
vain.  Various  analogy  models  have  been  suggested.  Glushko  (1979,  1981), 
Goswami  (1986,1988),  Goswami  and  Bryant  (1992),  Treiman  (1992)  suggest  that 
letter  chunks  represent  syllabic  constituents.  So,  when  analyzing  an  unknown 
word  (or  nonsense  word)  void,  there  is  a  preference  for  readers  to  segment  the 
word  into  v  aid,  the  syllabic  constituents  of  onset  and  rime,  rather  than  vai  d.    In 
this  view,  letter  chunks  that  form  linguistic  units  are  more  important  than  those  that 
do  not.  This  is  true  because  humans  are  predisposed  to  segment  the  visual  stimulus 
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into  the  natural  constituents  of  speech.  Correspondingly,  I  would  add  also  that  this 
is  true  because  scripts  themselves  represent  linguistic  units. 

The  interactive  models  of  Seidenberg  and  McClelland  (1989)  and 
Seidenberg  (1987,  1989,  1992)  are  similar  to  the  analogy  models  above,  except  for 
one  point.  The  formation  of  letter  chunks  is  based  simply  on  distributional  letter 
properties,  not  on  any  perceptual  preference  for  organizing  the  stimulus  into 
linguistic  units.  Sequence  chunks  are  based  on  the  frequency  with  which  strings 
occur  in  the  visual  stimulus  and  the  consistency  of  their  pronunciation  only. 
Syllable  and  morphological  structure  is  not  directly  represented  but  rather  falls  out 
of  distributional  probabilities  of  letter  sequences.   Accordingly,  the  preference  for 
chunking  the  orthographic  sequence  void  as  v  aid  over  vai  d  is  simply  because 
the  former  occurs  more  frequently  with  consistent  pronunciation.  Hence,  the 
distributional  properties  of  letter  patterns  will  activate  certain  sequences  more  than 
others.  Letter  sequences  will  correspond  to  linguistic  units  below  the  word  level 
butnot  necessarily  because  people  are  naturally  disposed  to  segment  written 
language  into  speech  components  like  onset  and  rime. 

I  believe  that  the  Seidenberg  and  McClelland  view  is  trying  to  emphasize 
that  people  will  extract  whatever  regularities  are  present  in  the  script.  It  is  true,  for 
example,  that  readers  will  extract  particular  linguistic  structures  like  onsets  and 
rimes  only  where  they  are  consistently  represented.    Yet,  there  is  a  difficulty  with 
suggesting  that  choosing  particular  sequences  over  others  is  not  a  result  of 
perceptual  preferences.    Such  a  suggestion  obviates  the  reason  why  distributional 
letter  properties  reflect  linguistic  structures  and  cues  in  the  first  place.   Following 
this  logic,  there  should  be  no  reason  that  a  syllab.c  organization  into  onset  and  nme 
is  preferable  to  chunking  arbitrary  letter  patterns  (e.g.  there  would  be  no  reason  for 
v  -aid   to  occur  with  more  consistent  pronunciation  than  vai-  d,  or  va-id). 
Clearly,  however,  certain  sequences  occur  more  frequently  and  with  more 
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consistent  pronunciation  than  others  because  they  reflect  some  property  of  the 
spoken  language.  Hence,  readers  are  able  make  associations  between  orthographic 
form  and  their  linguistic  knowledge.  The  fact  that  orthographies  all  represent 
linguistic  structures  suggests  that  there  is  a  processing  advantage  for  readers  to 
chunk  the  visual  stimulus  into  units  that  correspond  to  those  of  spoken  language.  If 
there  were  no  processing  advantage  for  chunking  the  visual  stimulus  into  linguistic 
units,  we  should  be  just  as  likely  to  see  orthographic  systems,  say  an  alphabet, 
whose  distributional  letter  patterns  were  largely  arbitrary.  However,  this  is  never 
the  case. 

Reading  and  Morphology 
Because  most  reading  theories  have  focused  on  the  phonological  and  lexical 
aspects  of  visual  language  processing,  attention  to  the  morphological  properties  of 
reading  has  been  lacking.  This  is  due  in  large  part  to  the  fact  that  most  research 
deals  with  alphabetic  systems,  English  in  particular,  which  are  presumed  to  involve 
phonological  and/or  lexical  processing.  The  one  exception  to  this  rule  is  interactive 
activation  theories,  which  (at  least  in  principle)  can  include  morphological 
processing  in  reading,   However,  there  have  been  empirical  studies  which  have  led 
some  researchers  to  conclude  that  morphological  structure  is  indeed  used  to 
recognize  words  (Laudanna  et  al.  1989;  Feldman  1987;  Benton  and  Frost  1995; 
Chialant  and  Caramazza  1995;  Stoltz  and  Feldman  1995).  I  will  review  this 
research  in  chapter  4. 

Teaching  Philosophies: 
Code  versus  Whole  Language  Approaches  to  Reading  Acquisition 

In  the  above  review  of  reading  theories,  I  have  mainly  addressed  issues  of 

how  visual  language  is  processed  by  the  learner/reader.  Reading  research,  to  some 

degree,  has  had  an  impact  on  educational  policy.  In  this  section,  I  will  discuss  the 
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dominant  schools  of  thought  which  underlie  teaching  methods,  and  how  those 
methods  are  informed  by  reading  research.  I  do  not  here  presume  to  delineate  all 
the  types  of  early  reading  teaching  strategies,  which  undoubtedly  are  many  and 
eclectic.    However,  there  are  distinct  schools  of  thought  which  make  essentially 
different  assumptions  about  the  learning  process.  As  a  matter  of  comparison,  I  will 
present  the  poles  of  these  views. 

The  first  is  what  I  will  call  the  code  '  approach.  In  this  view,  learning  to 
read  involves  interpreting  a  fairly  complex  visual  system  and  analyzing  how  that 
system  maps  onto  natural  language.  The  code  approach  to  reading  has  been  d,rectly 
influenced  by  theories  of  phonological  receding  and  research  which  points  to  a 
correlation  between  phonological  awareness  and  reading  success.  Hence,  the 
principal  goal  is  teaching  beginning  readers  the  "alphabetic  principle"  -  that  letters 
signify  sounds  and  sounds  combine  to  form  words.  Phonics,  the  study  of  the 
relationship  between  the  sounds  in  spoken  words  and  the  letters  in  written  words, 
is  a  standard  tool  for  the  code  approach  to  reading.  A  distinguishing  characteristic 
of  the  code  approach  is  that  it  assumes  that  the  code  must  be  explicitly  taught  to  new 
readers.  Accordingly,  learning  the  code  involves  conscious,  directed  analysis  of 
letter-sound  relations  and  conscious  understanding  of  the  rules  that  govern  them. 

The  other  teaching  philosophy  which  enjoys  wide  support  is  called  whole- 
language*.  The  whole-language  approach  to  teaching  reading  is  predicated  on  the 
belief  that  comprehension  and  communication  are  pivotal  to  the  learning  process. 
Hence,  children  should  work  with  meaningful  materials,  and  that  they  read  and 
write  about  topics  that  interest  them.  In  this  view  (Goodman  1986),  children  are 


7  This  term,  popularized  by  Chall  1967,  has  been  widely  used  as  a  general  label  for  a  type  of 
readmg  instrucuon  that  emphasizes  that  written  language  is  a  code  that  children  must  learn  in  order 

«  This  is  not  to  be  confused  with  the  whole-word  theory  of  word  recognition,  which  is  a  theory  of 

k  nThT ^TT  """'I  fEUage-  "***"*'* is  a  Philosophy  of  teaching,  whose  scope 
a  much  broader  than  amply  begmnmg  readmg.  I  will  however,  discuss  it  as  a  teaching  method 
only  in  regard  to  early  reading  acquisition. 


27 


are 
or 


good  at  understanding  and  producing  language  when  they  need  to  express 
themselves.  Hence,  given  an  environment  where  the  input  is  meaningful  and 
purposeful,  children  should  be  just  as  good  at  learning  written  language  as  they 
at  learning  spoken  language.  Meaningful  materials  include  children's  literature 
stories  generated  by  the  children  themselves.  A  corollary  is  that  children  should 
never  be  forced  to  read  parts  of  text  (words)  in  isolation  or  out  of  context  since  this 
would  distort  natural  language  patterns  and  inhibit  the  communication  of  meaning. 
Advocates  of  whole-language  do  not  endorse  teaching  the  relation  between  sounds 
and  letters  or  memorization  of  spelling  rules.  In  fact,  in  this  view,  the  teaching  of 
phonics  could  be  harmful  because  it  distorts  language  into  small,  abstract  pieces, 
which  can  interfere  with  the  goal  of  communication.  Instead,  children  should 
figure  out  sound-letter  relationships  on  their  own  from  their  experience  with  printed 
and  spoken  words.  Analysis  of  sounds  in  relation  to  print  should  only  be 
encouraged  when  the  teacher  observes  children  have  already  made  the  connections 
on  their  own.  Hence,  each  child  is  an  "autonomous"  learner,  while  teachers  and 
parents  serve  as  "guides"  and  "facilitators,"  making  sure  the  children  are  presented 
with  meaningful  text. 

Strengths  and  Weaknesses  of  the  Code  Approach 

The  code  emphasis  is  bolstered  by  a  great  deal  of  experimental  data  which 
point  to  a  strong  and  consistent  relationship  between  phonological  awareness  and 
reading.  Yet,  even  though  there  is  little  doubt  that  phonological  awareness  plays  a 
role  in  reading,  it  is  far  less  clear  how  children  learn  the  relationships  between 
sound  and  print  Whether  children  must  be  taught  such  relationships  or  figure  them 
out  on  their  own  is  far  from  resolved.  While  it  has  been  shown  that  children  can 
benefit  from  explicit  phonemic  awareness  training  and  instruction  in  letter-sound 
relationships  (Adams  1990;  Bradley  and  Bryant  1983;  Byrne  and  Feilding- 
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Barnsley  1990,  Byrne  1992),  it  is  also  true  that  normal  readers  become 
phonologically  aware  as  they  learn  to  read  and  spell,  regardless  of  the  type  of 
instruction  they  receive  (Gough  and  Hillinger  1980;  Gough,  Juel  and  Griffith  1990; 
Perfett.  et  al  1987;  Treiman  1993).  Moreover,  since  reading  and  phonological 
awareness  seem  to  reciprocally  facilitate  each  other,  this  is  a  difficult  relationship  to 
sort  out. 

One  incontrovertible  fact  in  support  of  the  code  emphasis  is  that  written 
language,  across  scripts  and  script  types,  does  indeed  systematically  represent 
spoken  language.  It  is  a  code  for  speech.  Of  course,  successfully  teaching  "the 
code"  to  beginning  readers  relies  on  a  clear  understanding  of  what  the  code  is.  It  is 
this  understanding,  or  rather,  lack  of  it,  which  I  will  argue  often  presents 
difficulties.  Presently,  the  code  approach  is  primarily  influenced  by  theories  of 
phonemic  receding.  Consider  English,  where  letter-to-phoneme  correspondences 
do  not  account  for  many  words,  or  account  for  them  only  partially.  The  rules  of 
phonics  are  many  and  complex  and  have  many  exceptions.  A  careful  consideration 
of  the  facts  of  English  orthography  suggests  that  letter-to-phoneme  translations,  and 
the  rules  that  govern  them,  are  only  part  of  the  code,  only  a  fraction  of  the  system 
that  the  reader  must  internals.  I  suggest  that  interpreting  the  orthographic  code 
involves  deciphering  linguistic  patterns  beyond  the  phoneme:  syllable  constituents, 
morphophonemic  patterns,  lexical  and  functional  morphemes,  and  so  on,  and  that 
this  is  the  rule  across  scripts  and  script  types. 

An  additional  and  very  important  point  is  that,  if  we  accept  that  the  code  is 
only  partly  a  phonemic  one,  then  the  many  phonics  rules  which  try  to  force  the  code 
to  be  strictly  phonemic,  are  not  likely  to  be  the  system  that  the  reader  actually 
learns.  Consider  the  "silent  e"  rule  which  makes  vowels  "long".  By  applying  this 
rule  the  reader  knows  that  code  is  pronounced  /cod/  as  opposed  to  cod ,  which  is 
pronounced  /cad/.  Yet  it  is  possible,  instead,  that  English  orthographic 
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interpretation  relies  on  some  kind  of  analogy  system  whereby  readers  internalize  the 
pronunciations  of  onsets  and  rimes:  c  as  /k/,  ode  as  /od/,  and  od  as /ad/.  If  this  is 
true,  teachers  who  faithfully  instruct  their  students  in  the  rules  of  phonics  may  be 
teaching  a  system  that  is  different  from  the  one  the  reader  actually  uses. 

A  crucial  assumption  of  the  code  philosophy  is  that  literacy  acquisition, 
unlike  first  language  acquisition,  does  not  proceed  automatically  but  requires 
explicit  instruction.  In  this  view,  first  language  acquisition  and  literacy  acquisition 
follow  basically  different  developmental  routes.  Spoken  language  has  evolved  as  a 
natural  process  of  the  human  brain.  Evidence  of  this  is  clear:  grammatical 
competence  in  spoken  language  is  naturally  acquired.  It  begins  in  infancy  and  is 
largely  completed  by  school  age.  Despite  the  fact  that  first  language  input  is 
incomplete  and  often  irregular  and  indirect,  the  pattern  and  rate  of  acquisition 
appears  to  be  universal  for  normal  children.  This  holds  despite  differences  in 
language  form,  and  a  fabric  of  social  and  cultural  factors.  Furthermore,  acquisition 
occurs  without  conscious  effort  or  awareness,  resulting  in  an  implicit  or  tacit 
knowledge  base. 

Written  language,  on  the  other  hand,  is  only  a  representation  of  natural 
language  according  to  the  code  view.  It  maps  onto  the  spoken  system  in  a  way  that 
must  be  consciously  understood  if  it  is  to  be  used  properly  (Liberman  and  Liberman 
1992).  Since  teaching  intervention  is  thought  to  be  the  critical  factor  in  learning  to 
read,  advocates  of  the  code  view  endorse  explicit  learning/teaching  methods  like 
phonics.  Again  this  view  assumes  that  such  instruction  will  make  clear  all  the 
associations  represented  in  an  orthographic  code.  Yet,  if  it  is  true  that  the  written 
language  code  is  not  strictly  phonemic,  then  even  code-trained  readers  learn 
considerably  more  than  they  are  taught.  It  would  seem  then,  that  learning  the 
orthographic  code  does  not  entirely  oblige  explicit  teaching  and  conscious  analysis. 
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It  is  possible  instead  that  learners  implicitly  induce  many  of  the  regularities  of  the 
orthographic  system. 

Strengths  and  Weakness  of  Whole  Language 

The  whole-language  philosophy  is  intuitively  appealing  and  widely 
supported  by  teachers.  Some  of  its  basic  tenets,  such  as  the  importance  of  print 
exposure  for  preschool  children,  have  influenced  educational  policy.  Indeed,  there 
has  been  shown  to  be  a  positive  correlation  between  exposure  to  print  and  reading 
success  (Cunningham  and  Stanovich  1990).  The  greater  children's  experience  with 
books  and  stories,  the  more  they  are  aware  of  the  print-language  connection. 
Whole-language  captures  the  intuition  that  readers  learn  much  more  than  they  are 
taught.    Further,  most  readers  are  not  wholly  conscious  of  what  they  know  about 
the  system.  For  example,  it  is  difficult  for  most  people  to  articulate  how  the  written 
system  operates. 

Although  whole-language  enjoys  wide  support,  it  also  has  received  some 
strong  criticism.  The  essential  (and  I  believe  well-founded)  cnticism  is  that  whole- 
language  does  not  identify  the  principles  or  mechanisms  of  learning  upon  which  it 
i«  based.  Because  this  view  assumes  that  readers  make  orthographic  connections 
spontaneously,  it  is  unconcerned  with  the  specifics  of  what  the  code  is  and  how  it 
maps  onto  natural  language.  It  does  not  address  the  similarities  or  differences 
between  spoken  and  written  language  since  it  presumes  they  are  acquired  the  same 
way.      It  does  not  address  how  children  are  able  to  learn  so  much  in  the  absence  of 
direct  instruction.  What  are  the  mechanisms  that  cause  children  to  make  the 
connections  between  print  and  speech  for  themselves?  This  view  suggests  that  they 
are  the  same  mechanisms  as  those  that  govern  first  language  acquisition,  though  it 
does  not  discuss  what  those  are  specifically.  Among  theories  of  language 
acquisition,  the  most  influential  and  widely  received  in  linguistics  and  related  fields 
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is  that,  as  a  result  of  evolutionary  adaptation,  humans  are  biologically  predisposed 
to  learn  language.  Further,  learning  is  constrained  by  basic  principles  of  language 
structure  that  are  also  biologically  f,xed.  Whole-language  advocates  do  not  even  try 
to  explain  how  the  acquisition  of  written  language  could  possibly  parallel  this  type 
of  development.  Certainly,  nobody  has  claimed  that  written  language  structures  are 
also  an  innate  component  of  the  language  faculty.  Nor  does  whole-language  posit 
an  alternative  theory  of  language  acquisition  in  which  it  would  be  possible  that  the 
spoken  and  written  forms  could  take  the  same  developmental  path.  Additionally, 
the  whole-language  philosophy  does  not  employ  reading  theories  or  empirical 
studies  of  word  recognition  and  linguistic  awareness  to  support  its  claims.  Neither 
have  advocates  of  whole-language  initiated  research  which  would  help  identify 
(within  this  framework)  what  it  is  that  some  children  learn  that  make  them  good 
readers,  and  what  other  children  miss  that  become  poor  readers. 

Summary  and  Concluding  Remarks 
Each  individual  theory  of  reading  acquisition  captures  something  important 
and  distinctive  about  the  reading  process.  Phonic  theories  are  correctly  based  on 
the  notion  that  reading  involves  phonological  processing  and  that  some  kind  of 
emergent  linguistic  awareness  is  part  of  the  learning  process.  Whole-word  theories 
rightly  address  the  issue  that  normal  readers  identify  words  much  too  fast  for  letter- 
by-letter  analysis.  Dual-route  theories  present  the  idea  that  there  may  be  more  than 
one  recognition  pathway.  This  captures  the  idea  that  word  identification  via 
phonological  receding  is  slow  and  there  must  be  an  alternative,  more  direct 
(semantic)  route  to  lexical  access.  Interactive  activation  theories  reconceptualize  the 
notion  that  both  semantic  and  phonological  information  is  used  to  identify  words. 
They  posit  not  that  there  are  separate  routes  to  word  recognition,  but  that  words 
have  both  phonological  and  semantic  representations  which  can  be  simultaneously 


32 


activated.  Further,  the  phonological  component  does  not  rely  on  conscious  rules  of 
phoneme-letter  translation.  Rather,  phonological  representations  are  an  intrinsic 
result  of  letter  pattern  recognition.  Moreover,  this  theory  can  accommodate  the 
possibility  of  other  levels  of  representation  inherent  in  the  orthographic  code. 

Likewise,  both  the  code  and  whole-language  teaching  philosophies  reflect 
important  generalizations  about  reading  acquisition.  The  code  approach,  which  is 
founded  not  only  on  practical  observation  but  a  good  deal  of  research,  rightly 
asserts  that  learning  to  read  hinges  on  deciphering  and  internalizing  the  orthographic 
system.  Whole-language  captures  the  idea,  which  is  based  on  much  practical 
observation,  that  readers  can  naturally  induce  the  principles  of  the  orthographic 
system  (whatever  they  may  be),  without  a  lot  of  explicit  instruction  or  conscious 
analysis. 

It  would  seem  that  the  code  and  whole-language  views  make  opposing 
assumptions  about  learning.  Yet,  I  believe  that  basic  insights  of  both  views  can  be 
usefully  appropriated  towards  a  theory  of  reading  acquisition.   The  fact  that  written 
language  is  a  code  for  spoken  language  is  well  supported  by  research  and  widely 
accepted  (whole-language  does  not  dispute  this).  Researchers  continue  to  grapple 
with  the  issues  of  what  orthographic  systems  represent  about  spoken  language  and 
how  humans  process  this  information.  I  believe  that  exploring  a  basic  insight  of 
whole-language  -  that  readers  can  and  do  implicitly  induce  orthographic  properties 
-  can  further  the  aim  of  discovering  the  what  and  the  how  of  orthographic 
processing. 

The  whole-language  view  endorses  a  kind  of  implicit  learning  process  for 
reading  without  bothering  to  specify  what  it  is  or  how  it  works.   This  is  due  partly 
to  the  fact  that  the  implicit  learning  mechanisms  involved  are  not  well  understood 
and  are  outside  the  scope  of  traditional  education.  Yet,  within  other  disciplines 
such  as  psychology  and  computer  science,  there  is  abundant  evidence  that  implicit 
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learning  processes  are  efficient,  principled  cognitive  functions  that  are  very  good  at 
processing  certain  kinds  of  information.  I  suggest  that  decoding  orthographic 
representation  is  one  of  these  types.  Connecting  the  principles  of  implicit  learning 
with  the  facts  that  are  known  about  orthographic  systems  and  processing,  can  only 
further  the  discussion. 


CHAPTER 3 
MULTILEVELED  REPRESENTATION:  ORTHOGRAPHIC  EVIDENCE 


Understanding  the  process  of  reading  acquisition  relies  on  an  understanding 
of  what  an  orthographic  system  is.    Many  theories  of  reading  acquisition  are 
premised  on  the  idea  that  processing  visual  language  is  influenced  by  what  is 
represented  in  the  orthography.  Reading  research  has  relied  on  the  assumption  that 
the  representational  relationship  between  speech  and  writing  is  straightforward. 
For  example,  the  interest  in  alphabetic  scripts,  which  has  dominated  reading 
research,  has  long  presumed  that  alphabets  represent  phonemic  systems.  Attesting 
to  the  currency  of  this  assumption  is  an  enormous  body  of  literature  which  explores 
the  relationship  between  letters  and  words  and  their  connection  to  phonemes  and 
words.  Despite  the  fact  that  English  script  is  known  to  be  "irregular"  in  its 
representation,  and  some  have  pointed  to  orthographic  representation  of 
morphology  (Chomsky  and  Halle  1968;  Klima  1972;  Read  1983),  analyses  of 
English  orthography  have  been  principally  concerned  with  describing  letter-to- 
phoneme  mappings  and  the  rules  that  govern  them. 

The  idea  that  the  alphabetic  systems  may  encode  other  than  phonemic 
linguistic  information  is  relevant  to  improving  our  understanding  of  how  visual 
language  is  processed.  Yet,  the  continuing  interest  in  how  the  phonemic  aspects  of 
the  system  are  learned  has  overshadowed  such  concerns  and,  with  a  few  notable 
exceptions,  this  issue  has  remained  in  the  background  of  the  literature  on  reading 
acquisition.  I  believe  that  the  study  of  writing  systems,  which  specifically  concerns 
itself  with  what  scripts  represent,  can  inform  reading  research  on  this  issue.  This  is 
the  case  not  only  because  reading  research  should  look  more  closely  at 
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representational  relationships  within  particular  scripts,  but  also  because  analyses  of 
the  world's  scripts  reveal  striking  commonalities  among  seemingly  diverse 
systems.  The  idea  that  various  scripts  are  very  much  alike  in  regard  to  the  manner 
in  which  linguistic  information  is  presented  has  important  implications  for  language 
processing  and  can  significantly  advance  reading  research. 

Two  relevant  observations  emerge  from  a  survey  of  the  world  scripts.  The 
first  is  that  orthographies  are  usually  only  partially  representative  of  any  particular 
level  of  linguistic  knowledge.  (Chomsky  and  Halle  1968;  Chomsky  1970; 
Coulmas  1989;  Miller  1994;  Daniels  1992, 1996).  Even  scripts  that  are  thought  to 
be  very  consistent  omit  considerable  information,  making  them  incomplete 
renderings  of  a  linguistic  system.    For  example,  phonological  scripts  do  not  depict 
all  the  phonological  information  present  in  the  spoken  language.  Segmental  scripts 
may  only  partly  represent  phonemic  systems.  Korean,  a  segmental-syllabic  script, 
can  unambiguously  represent  all  the  segments  of  the  language  but  in  some  cases  it 
surrenders  segmental  uniformity  in  favor  of  morphological  consistency.  Many 
phonological  scripts  omit  phonemic  stress  (English,  Russian).  Some  phonological 
scripts  represent  phonemic  tone  (Vietnamese,  Burmese,  Thai)  but  others  do  not 
(Serbo-Croation,  Slovene,  Lithuanian).  Abjads  leave  out  significant  segmental 
information  (vowels)  based  on  morphological  predictability.  No  phonological 
scripts  give  narrow  phonetic  transcriptions  of  their  languages.    Likewise, 
logographic  scripts  supply  incomplete  morphological  information.  This  is  the  case 
with  Chinese,  where  the  morphological  component  of  the  characters  sometimes 
give  substantial  semantic  information  and  sometimes  only  a  vague  hint  (Coulmas 
1989).  Similarly,  Sumerian  cuneiform,  in  its  early  stages  represented  lexical  but 
not  grammatical  morphemes.  (Cooper  1996). 

The  second  observation  is  that  orthographies  usually  represent  aspects  of 
more  than  one  "level"  simultaneously.  That  is,  scripts  are  likely  to  (partially) 
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represent  some  combination  of  phonemes,  syllables,  morphemes,  and  words  all  at 
once.  (Gleitman  and  Rozin  1977;  Henderson  1982;  Sampson  1985;  Commas 
1989;  Daniels  1992,  1996;  Miller  1994).  Evidence  of  this  is  widespread  across 
scripts.  In  this  chapter,  I  will  discuss  how  various  levels  of  linguistic  information 
(segmental,  syllabic,  morphological,  lexical)  are  included  in  various  scripts,  giving 
particular  attention  to  how  alphabetic  scripts  can  represent  information  beyond  the 
phoneme.  What  emerges  from  this  assessment  is  that  apparently  dissimilar  scripts 
are  very  much  alike  with  respect  to  the  ways  in  which  they  depict  linguistic 
information.  That  is,  orthographies  are  likely  to  partially  represent  several  aspects 
of  linguistic  structure  at  the  same  time    Moreover,  this  discussion  prefaces  the  next 
chapter  which  reviews  existing  research  that  indicates  that  people  do  use  the  various 
linguistic  structures  encoded  in  the  orthography  in  order  to  process  visual  language. 
A  Survey  of  Script  Types 
The  level  of  linguistic  structure  most  widely  represented  among  scripts  is 
the  syllable.  In  addition  to  syllabaries  which  overtly  represent  syllables,  both 
alphabets  and  logographies  include  syllabic  information  to  varying  degrees9.    As 
Daniels  (1992)  argues,  the  preference  for  syllabic  representation  is  likely  a  result  of 
the  perceptual  salience  of  the  syllabic  units  of  speech. 

The  distinguishing  characteristic  of  a  syllabary  is  that  characters  denote 
particular  syllables  without  reference  to  the  phonemes  that  comprise  them.  Usually, 
syllabaries  represent  CV  syllables,  having  been  adapted  for  languages  with  fairly 
simple  canonical  structures.  Syllabary  characters  almost  never  (with  the  exception 
of  Akkadian  (Daniels  1992))  represent  VC  syllables'". 


'Typological  terminology  for  various  scripts  is  taken  from  Daniels  1996. 

Interestingly,  syllabaries  represent  CV's  even  in  languages  that  have  codas  and  long  vowels 
For  languages  that  allow  coda's  and  long  vowels,  a  CV  syllabary  usually  some  convention  for 
dealing  with  them-separate  characters  or  diacritics.  Hence,  syllabaries  may  he  morale  in  the  sense 
that  they  distinguish  the  peak  with  post  peak  elements.  However  they  are  still  syllabic  in  the 
sense  that  the  elements  of  the  syllable  are  combined  to  form  syllable  units  and  syllable  units  are 
spaually  separated  from  each  other.  In  addition,  there  may  be  perceptual  reasons  that  syllabaries 
always  represent  CV's.  Generally  the  auditory  cues  present  in  CV's  are  more  robust  than  VCs 
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Examples  of  a  syllabary  are  employed  in  Japanese  writing:  hiragana  and 
katakana.    The  kana  are  complete  scripts  in  that  they  represent  all  the  syllables  in 
the  language  (Faber  1992)  and  in  principle,  anything  in  the  language  could  be 
expressed  solely  via  the  syllabaries.     However,  Japanese  is  typically  written  in  a 
mixture  of  kana  and  kanji,  a  logographic  script,  mainly  of  Chinese  origin.   The 
kana  syllabaries  represent  CV  and  CyV  combinations.  The  hiragana,  which 
consists  of  46  characters  supplemented  by  diacritics,  is  used  for  grammatical 
elements:  particles,  auxiliary  verbs,  and  the  inflectional  affixes.   The  katakana  is 
used  for  foreign  names,  loan  words,  onomatopoeic  and  mimetic  words, 
exclamations  and  some  scientific  terminology.  The  kanji  are  used  mainly  to  encode 
primary  lexical  categories.  Additionally,  the  kanji  characters  independently 
represent  various  types  of  information.  Each  kanji  character  may  have  different 
pronunciations.  The  on-readings  are  based  on  the  original  Chinese  pronunciations; 
the  Awn-readings  correspond  to  a  Japanese  morpheme  that  corresponds  to  the 
meaning  of  the  character.  The  same  character  may  stand  for  several  different 
morphemes,  each  with  its  specific  meaning  and  reading.  Also,  when  the  characters 
stand  for  a  Chinese  morpheme  (on-reading),  the  phonetic  component  of  the 
character  sometimes  give  clues  to  the  pronunciation  The  mixture  of  these  three 
scripts  in  Japanese  writing  is  an  obvious  example  of  a  mixed  phonological  and 
morphological  system.  (Shibamoto  Smith  1996;  Morton  and  Sasanuma  1984). 

While  "pure"  syllabaries  do  not  encode  phonemes,  some  implicitly  encode 
some  segmental  information.  The  Mycenean  (Linear  B)  script  is  an  example  of  a 
syllabary  that  represents  syllables  without  explicitly  indicating  the  segments  that 
comprise  them.  However,  some  segmental  information  is  implicitly  indicated. 
Characters  in  the  Mycenean  script  represent  V,  CV  and  sometime  CCV.  Except 


(Oula  and  Kawasaki  1984).  There  is  a  more  reliable  set  of  place  cues  in  the  CV  transition  than  in 
the  VC  transition.  Therefore,  listeners  weight  the  former  more  heavily  in  deciding  what  they  have 
heard  (Ohala  1990). 
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vowel  initial  syllables,  Mycenean  graphs  do  not  indicate  individual  segments  and 
coda  consonants  are  generally  ignored.  However,  onset  clusters  are  indicated  by 
the  graphs  for  the  appropriate  consonants  plus  the  nuclear  vowel  of  the  syllable. 
Hence,  there  is  a  dummy  copy  of  the  peak  vowel.  For  example,  the  onset  cluster  in 
the  syllable  [tri]  is  indicated  by  the  graphs  for  the  syllables  ti  +  ri .  Further,  coda 
clusters  that  are  possible  onsets  (according  to  the  sonority  scale)  are  indicated  by 
writing  the  sign  for  the  first  segment  of  the  cluster.  In  these  cases,  according  to 
graphic  conventions,  the  very  final  coda  consonant  is  not  represented.  Miller 
(1994)  argues  that  these  facts  indicate  that  the  Mycenaean  system  recognizes 
segments  as  isolable  units,  and  also  that  using  it  requires  implicit  knowledge  of  the 
sonority  hierarchy. 

Vai  script,  an  indigenous  writing  system  of  the  Vai  language  of  west  Africa, 
is  a  syllable/moraic  representation.    Mora  is  a  prosodic  unit,  a  formalization  of 
phonological  weight,  intervening  between  the  phonemic  string  and  the  syllable.  In 
Vai  script,  the  weight  of  the  syllable  determines  the  number  of  characters  that 
represent  it.  The  characters  represent  CVs.  Light  syllables  are  indicated  by  one 
CV  character.  Heavy  (two  mora)  syllables  are  indicated  with  two  characters.  The 
only  closed  syllables  are  those  with  velar  nasals.  This  nasal  coda  is  represented 
with  a  separate  character.  The  same  nasal  character  may  be  syllabic  in  onset 
position.  Also,  long  vowels  and  diphthongs  (two  mora  peaks)  are  written  with  two 
characters.  The  spelling  of  syllables  containing  long  vowels  and  diphthongs  are 
variable.  For  example  [taa]  is  usually  written  with  the  graphs  for  ta.ha,  not  ta.a  . 
Similarly,  [lakoa]  is  written  la.ko.wa.  The  graph  for  the  syllable  [a]  is  reserved  for 
words  that  begin  with  [a].  However,  the  graph  for  the  syllable  [i]  can  occur  both 
word  initially  and  elsewhere".  Syllables  with  onset  clusters  are  written  with  a 
single  character  that  represent  that  particular  cluster  plus  vowel.  This  is  consistent 

1 1  These  conventions  could  contribute  to  word  identification  since  Vai  does  not  explicidy  signal 
word  boundaries  -  there  are  no  spaces  between  words  or  other  graphic  indicators. 
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with  the  convention  that  segmental  constituents  of  the  syllable  are  individually 

indicated  only  when  they  contribute  to  syllable  weight.  Vai  has  lexical  tone  which 

is  not  represented  in  the  script,  and  there  is  no  division  between  words,  though 

distributional  properties  among  the  characters  make  these  omissions  less  confusing 

then  they  might  be.  There  is  a  character  to  signal  the  end  of  a  sentence  (Singler 

1996).  Hence,  Vai  script  represents  a  variety  of  phonological  information: 

syllables,  syllable  weight,  coda  segments.  Yet,  it  omits  considerable  information  as 

well,  for  example,  it  does  not  indicate  individual  segments  within  an  onset  cluster, 

phonemic  tone  or  word  boundaries. 

A  neosyllabary'2  represents  syllables  and  their  constituent  segments.  In  a 

neosyllabary,  each  character  indicates  a  consonant  plus  a  specific  vowel.  Other 

vowels  are  denoted  by  a  consistent  modification  of  the  consonant  symbols.  The 

Indie  scripts  (Devangari,  Punjabi,  Bengali  and  others)  and  several  Southeast  Asian 

scripts  (Burmese,  Lao,  Tai,  Khmer)  are  examples  of  this.  Brahmi,  the  ancestor  to 

many  modern  scripts  of  this  type,  is  based  on  open  syllables:  V,  CV,  CCV,  etc.    V 

syllables  are  indicated  by  independent  signs.  CV  syllables  are  represented  by  the 

consonant  plus  a  diacritic  sign  for  the  vowel  which  is  attached  to  the  consonant. 

Consonant  characters  without  any  diacritic  vowel  are  understood  to  represent  the 

consonant  plus  "inherent"  default  vowel  /a/.  A  syllable  consisting  of  a  consonant 

cluster  is  denoted  by  consonants  that  are  joined  in  a  conjunct  character,  indicating 

the  cancellation  of  the  inherent  vowel  (Salomon  1996).  Note  that  this  is  different 

from  a  regular  syllabary,  which  has  no  formal  mechanism  for  denoting  the 

segmental  constituents  of  the  syllable.  Neosyllabaries  may  have  conventions  for 

distinguishing  syllabic  constituents.  For  example,  coda  consonants  may  be 

distingusished  from  onset  consonants,  as  in  Tibetan  (van  der  Kuijp  1996)  and 

Burmese  (Wheatley  1996),  and  many  neosyllabaries  distinguish  long  and  short 

12  This  term  which  I  have  taken  from  Daniels  1992,  is  laken  from  Fevrier  (1948).  The  Ethiopic 
name  for  (his,  abugida,  is  also  used. 
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vowels  (e.g.,  Brahmi).  Further,  a  neosyllabary  may  have  conventions  for 
representing  tone.  Tone  may  be  marked  by  additional  signs  as  in  Thai  (Diller 
1996),  or,  as  with  the  Punjabi  script  Gurmukhi,  tone  is  indicated  by  the  consonants 
and  vowel  signs  (Gill  1996).  Also,  a  neosyllabary  may  indicate  word  boundaries 
or  phrase  boundaries  (e.g.,  Malayalam  (Mohanan  1996)),  or  it  may  not  (e.g.,  Lao 
and  Thai  (Diller  1996)). 

Korean  script,  the  Hankul,  (sometimes  called  an  alpha-syllabary)  encodes 
syllabic,  segmental  and  featural  information.  This  orthography  has  a  sign  for  each 
phoneme  in  the  language  but  it  is  written  in  syllable  blocks.  Any  syllable  must 
begin  with  a  consonant  sign.  To  form  the  peak  of  the  syllable  block,  the  vowel 
sign  much  attach  to  the  side  or  beneath  the  consonant,  following  the  principles  of 
stroke  order.  When  the  syllable  begins  with  a  vowel,  the  written  syllable  begins 
with  the  zero  sign,  representing  a  zero  consonant.  The  zero  symbol  also  represents 
syllable  final  /n/,  which  does  not  occur  syllable  initially.  Consonant  clusters  are 
denoted  by  the  joining  of  consonant  symbols.  The  hankul  is  called  a  featural 
system  because  the  shape  of  the  basic  consonants  is  a  graphic  representation  of  the 
places  of  articulation.  For  example,  the  sign  for  /k/  is  7 ,  which  denotes  the  tongue 
position  for  velar  articulation.  By  doubling  or  adding  strokes  to  the  five  basic 
consonant  shapes,  one  derives  the  shapes  for  the  aspirates,  tense  unaspirates, 
affricates  and  so  on.  Yet,  despite  its  segmental  accuracy,  modern  Hankul  does  not 
consistently  represent  phonemes  in  the  context  of  most  morphophonemic 
alternations  in  favor  of  keeping  the  morphemes  consistently  represented.  This  is 
true  for  nouns  and  verbs  but  is  particularly  the  case  for  nouns  (King  1996). 
Consequently,  the  Hankul  represents  a  variety  of  linguistic  information,  from 
features  to  morphemes. 

Another  type  of  syllabic  script  is  the  Cree  syllabary,  a  syllabic  "shorthand" 
created  for  two  Algonquian  languages  of  Canada,  Cree  and  Ojibwe  and  adapted  for 
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Athabaskan  and  Inuit  (Eskimo)  languages.  It  is  a  shorthand- based  script  employing 
geometric  characters,  some  representing  syllables,  some  segments.   There  is  no 
standardized  spelling  for  any  of  the  dialects  using  it  and  there  are  local  conventions 
for  character  inventory,  shape,  and  writing  conventions.  However,  some  of  its 
characteristic  features  are  the  following:  vowel  initial  syllables  are  written  with  a 
triangle  syllabic,  rotated  through  four  positions  to  show  vowel  quality.  Most 
consonant-initial  syllables  are  written  with  syllables  in  which  the  shape  shows  the 
consonant  and  the  orientation  shows  the  vowel.    Syllable-final  consonants  are 
written  with  small  alphabetic  characters  cal\ed  finals,  usually  subscripted.  The 
initial  C  of  a  consonant  cluster  can  be  written  with  a  final.  Vowel  length  may  be 
marked  with  a  dot  over  a  syllabic  (Nichols  1996).   Interestingly,  the  amount  and 
specificity  of  information  presented  in  this  script  will  vary  by  individual. 
Following  the  shorthand  origin  of  the  script,  writers  may  use  plain  syllables, 
indicating  basic  syllable  structure  or  pointed  syllables,  adding  diacritics  to  include 
segmental  information.  With  enough  diacritics,  a  writer  may  give  an  exact 
phonemic  transcription,  though  this  is  rare. 

The  Pahawh  Hmong  script  of  Southeast  Asia  (Laos)  is  a  phonological 
system  which  exhaustively  represents  the  syllabic  constituents,  onsets  and  rimes,  in 
the  language.  There  are  many  possible  consonantal  onsets  but  only  one  possible 
coda  ltd.  Each  graph  represents  an  onset  (C  or  C  cluster),  or  a  rime  (vowel,  coda 
In/,  and  tone,  which  is  represented  by  diacritics ).   The  graphs  are  written  rime- 
onset,  opposite  of  the  way  they  are  pronounced,  indicating  the  perceived  primacy  of 
the  rime  (V  plus  tone)  over  onset  consonants.  The  one  coda  consonant  is  always 
part  of  the  rime  graph,  the  symbol  of  onset  ltd  is  never  used  in  coda  position. 
Hmong  is  an  isolating  language,  consisting  of  mainly  monosyllabic  morphemes 
which  are  usually  represented  by  graph  pairs.  Each  pair  of  rime-onset/morpheme 
graphs  are  set  apart  by  spaces  and  are  written  left  to  right  across  the  page  (Ratliff 
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1996).  Hence,  Hmong  orthography  represents  syllables,  syllabic  constituents  and 
morphemes  at  once. 

In  logosyllabic  scripts,  characters  represent  words  or  morphemes  as  well  as 
specific  syllables.  This  type  of  script  is  likely  to  represent  a  language  where 
morphemes  comprise  a  single  syllable.   The  Pahawh  Hmong  Script  described 
above  is  one  kind  of  logosyllabic  orthography.  Sumerian  cuneiform  is  a  another 
variant  of  such  a  script.  In  Sumerian,  unbound  morphemes  are  represented  by 
logograms,  bound  morphemes  are  represented  by  rebus-derived  syllabic  signs, 
usually  V,  CV  or  VC  (Cooper  1996).  Likewise,  in  Maya  writing,  which  has  only 
partially  been  deciphered,  logograms  appear  in  combination  with  CV  syllable  signs 
(Macri  1996). 

Chinese,  which  has  been  traditionally  classified  as  a  logographic  script,  is 
another  type  of  logosyllabic  writing.  The  most  numerous  type  of  character  in 
modern  Chinese  script  is  that  which  contains  a  phonological  and  morphemic 
component  (Coulmas  1989).  The  phonetic  element  in  the  character  represents  a 
particular  syllable,  some  features  of  the  syllable,  or  the  syllable  and  accompanying 
tone.  The  classifier  or  radical  gives  semantic  clues  for  that  particular  morpheme 
Both  the  phonetic  and  the  classifier  offer  partial  cues  for  word  identification 
although  the  relative  importance  of  either  element  contributing  to  word  identification 
varies.    For  instance,  some  phonetics  occur  in  many  characters,  some  in  only  a 
few.  There  are  many  homophonous  forms  but  more  than  one  phonetic  element  to 
represent  the  various  forms.  For  example,  the  character  for  /tang/  uses  the  same 
phonetic  component  in  4  out  of  6  homophonous  forms;  the  other  2  use  a  different 
phonetic.  Similarly,  the  amount  of  information  conveyed  by  the  classifier  varies 
widely.  In  some  characters  the  classifier  is  clear  while  in  others  it  is  vague, 
providing  only  a  clue.  In  still  others,  it  merely  differentiates  similar  characters.   As 
a  result,  classifiers  and  phonetics  serve  mutually  diacritic  functions.  That  is,  in 
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some  cases  the  phonetic  is  clear  and  the  classifier  is  vague  so,  the  major  recognition 
comes  from  the  phonetic  element  and  the  classifier  serves  only  to  disambiguate 
forms.    In  other  cases  most  of  information  conveyed  by  character  is  semantic  while 
the  phonetic  serves  distinguish  similar  forms  (Coulmas  1989). 

In  an  Abjad ,  characters  represent  consonant  segments  only.  This  type  of 
script  was  originally  adapted  for  Semitic  language  structure  (e.g.,  Arabic,  Hebrew). 
The  Semitic  scripts  convey  (partially)  both  phonological  and  conceptual-semantic 
information.  Word  "roots,"  which  are  represented  with  a  skeleton  of  consonants, 
convey  general  semantic  information.  A  specific  meaning  is  established  by 
mounting  a  particular  "word  pattern"  onto  the  root.  The  word  patterns  are  vowel- 
melody  morphemes,  which  are  not  usually  represented  but  are  predictable  by 
context.  For  example,  the  root,  represented  by  the  letters  for  /k  s  r/,  refers  to  the 
concept  of  connection,  in  Hebrew.    By  inserting  the  word  pattern  [-e  -e]  on  this 
root,  the  resulting  word  is  /keser/  means  knot  or  connection  or  conspiracy.    The 
vowel -morphemes  may  be  indicated  by  diacritics  but  usually  are  not.  Words  are  set 
apart  by  spaces,  providing  some  lexical  cues.  (Bentin  and  Frost  1995;  Bauer  1996). 
Hence,  pronunciation  of  written  Arabic  and  Hebrew  relies  on  knowing  the 
languages'  morphological  patterns.  Conceptual-semantic  information  along  with 
some  phonological  and  lexical  information  are  represented  by  the  script. 
Morphological  information  is  usually  supplied  by  the  reader  so  that  the  meaning  of 
words  can  be  inferred. 

Alphabets 

Most  alphabetic  scripts  predominantly  represent  segmental  information. 
Yet,  in  addition  to  segmental  information,  an  alphabetic  script  may  represent 
syllabic,  morphophonemic  and  morphological  and  lexical  information  with  varying 
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degrees  of  explicitness.  Segments  are  most  explicitly  represented  by  individual 
letters.  Much  information,  however,  is  indicated  by  letter  patterns. 

Orthographic  regularities.  A  shared  feature  of  alphabetic  systems  is  that 
orthographic regularities  -  the  predictability  of  letter  patterns  within  a  word  --will 
indicate  recurring  structures  larger  than  the  phoneme.  For  example,  the  distribution 
of  letter  patterns  within  words  can  signal  syllable  boundaries  in  multisyllabic 
words.    Patterns  of  C's  and  V's  will  be  reflected  in  written  words,  creating 
inhomogeneous  frequencies  of  different  letter  sequences.   The  letter  distribution 
across  syllable  boundaries  is  often  distinct  from  letter  patterns  that  occur  within 
syllables.  Because  there  are  more  constraints  on  phonemes  that  can  occur  within 
syllables  than  across  them  (Seidenberg  1987,1989),  the  distributional  properties  of 
letter  patterns  insure  that  syllable  boundaries  are  often  marked  by  particular  letter 
tokens  (Adams  1981  in  Seidenberg  1987).  For  example,  syllable  boundaries  are 
often  flanked  by  letter  patterns  with  relatively  low  frequencies-  an  vil  or  vodka. 
The  syllable  boundary  bisects  the  lowest  frequency  bigram  in  a  word  (Seidenberg 
1987).  By  exploiting  distributional  facts,  information  about  syllable  boundaries  can 
be  extracted. 

In  principle,  the  above  is  true  for  all  sublexical  units  identified  by  recurring 
letter  sequences.  Hence,  an  alphabetic  orthography  will  also  tend  to  reflect  the 
morphological  composition  of  words.  Like  letter  patterns  across  syllables,  the 
distribution  of  letter  patterns  across  morpheme  boundaries  will  differ  from  those 
within  morphemes.  For  instance,  root  plus  affix  morpheme  boundaries  will  be 
indicated  by  the  frequency  of  particular  letter  patterns.  That  is,  the  letter  pattern  in 
the  word  pressing  reflects  its  composite  morphemes  as  press  +  ing  instead  of  pre 
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+  ssing  since  there  is  no  morpheme  -ssing..    Also,  the  sequence  -ing   occurs 
frequently  at  morpheme  boundaries. " 

Linguistic  Structures.  It  is  true  that  letter  pattern  frequencies  can  identify 
linguistic  information  other  than  phonemic  that  is  present  in  alphabetic  scripts. 
Though  all  alphabetic  scripts  share  this  basic  property,  particular  alphabetic  scripts 
will  have  various  conventions  for  more  explicitly  representing  different  types  of 
linguistic  information.  Further,  alphabetic  scripts  will  vary  with  respect  to  the 
degree  of  segmental,  syllabic  or  morphological  information  they  include.  For 
example,  some  alphabets  include  much  straightforward  phonemic  information, 
some  very  little.  English  is  a  good  example  of  a  script  that  has  various  conventions 
(using  letter  patterns)  for  indicating  several  different  types  of  linguistic  information. 

It  has  often  been  noted  that  English  orthography  is  not  straightfowardly 
phonemic.  Some  English  consonants  are  represented  by  a  single  letter.  For 
instance,  I  pi  is  usually  indicated  by  p,  /b/  by  b,  and  so  on.  Yet  even  just  for 
consonant-  segmental  level,  the  orthography  quickly  becomes  less  phonemically 
regular.  English  script  often  uses  different  letters  to  represent  the  same  sound:  Ikl 
is  represented  by  *,  c,  ck,  x,  and  qu,  Id  is  indicated  by/  and  ph,  1)1  by  j,  g,  dg,  lit 
by  l,  s,  and  x  ,  hi  by  d  and  t.  etc.    Conversely,  many  different  sounds  are 
represented  by  the  same  letter  or  letter  sequence:  c  represents  Ikl,  Isl,  1st,  s 
represents  hi,  Isl,  Iff  and  let  and  so  on.  Pronunciation  of  letters  that  represent 
several  sounds  are  often  predictable  by  context.  That  is,  surrounding  letter  patterns 


The  fact  that  morphological  and  syllabic  parsing  will  often  segment  different  sublexical  units 
has  been  of  issue  to  psycholinguists  for  some  time  (Coltheart  1978;  Taft  1979)  For  example  pre- 
ssing is  syllabic  parsing  while,  press-mg  is  morphological.  The  difficulty  has  resided  with     ' 
determining  which  linguistic  structures  the  reader  is  likely  to  parse.    As  it  is  clear  that  there  are 
various  types  of  linguistic  information  encoded  in  scripts,  my  suggestion  is  that  readers  extract  the 
linguistic  cues  present  in  the  visual  signal  simultaneously,  and  further  this  does  not  present  a 
conflict  for  processing,  but  an  advantage. 
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cue  the  reader  as  to  the  correct  pronunciation.  For  instance,  word  final  s  represents 
ItJ  after  voiced  segments  and  Is!  after  voiceless. 

Yet  it  is  really  English  vowels  which  make  clear  that  the  orthographic 
representation  cannot  be  strictly  phonemic.   Since  there  are  only  six  letters  to 
represent  fourteen  vowels,  a  system  of  rules  which  could  account  for  vowel-to- 
letter  correspondences  would  be  very  complex  with  many  exceptions. 
Alternatively,  as  Treiman  (1992,  1993)  points  out,  while  one  can  argue  that  vowels 
are  represented  very  inconsistently,  if  you  consider  the  sequences  of  graphemes 
permitted  to  compose  an  entire  rime,  the  orthographic  system  suddenly  becomes 
exceedingly  systematic.  She  suggests  that  English  orthography  represents  syllabic 
constituents:  onsets  and  rimes.   That  is,  onsets  and  rimes  are  indicated  in  the 
orthography  by  a  high  probability  of  sharing  certain  groups  letters.    By  way  of 
illustration,  consider  the  representation  of  vowels  in  English.  Instead  of  a  single 
letter  indicating  the  vowel,  in  many  cases  it  is  the  letter  sequence  composing  the 
entire  rime  that  informs  the  reader  of  the  vowel.  Conversely,  the  consonant(s) 
representing  the  onset  never  give  information  about  the  vowel.  For  example,  for 
any  particular  CVC  syllable  (e.g.,  /bade/,  spelled  back  or  /bek/,  spelled  bake ),  the 
initial  consonant-letter  does  not  give  any  information  about  the  pronunciation  of  the 
vowel.   However,  the  letters  that  denote  the  rest  of  the  rime  do  inform  the  reader 
about  the  vowel.  Consider  the  letter  a  in  monosyllabic  words.  In  a  phoneme-to- 
letter  mapping  system,  the  letter  a  is  thought  to  denote  several  different  vowels  as 
in:  blare,  blank,  blast,  blah,  blame,  ball.   In  all  these  examples,  the  consonant(s) 
that  compose  the  onset  do  not  give  the  reader  any  clue  which  vowel  the  letter  a 
represents.  However,  the  letters  that  compose  the  entire  rime  (e.g.,  -ame,  -all )  are 
crucial  for  recognition  of  the  vowel/rime.  This  pattern  of  entire  rimes  being 
consistently  represented  by  particular  grapheme  sequences  is  the  norm  in 
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monosyllabic  English  words".  Consequently,  English  orthography  marks  onsets 
and  rimes  as  distinct  units.  So,  instead  of  sound-spelling  rules  based  on  individual 
phonemes,  many  phonological-graphic  correspondences  are  based  on  larger 
syllable  constituents,  onsets  and  rimes. 

The  representational  system  for  vowels  in  multisyllabic  words  is  often 
vague.  For  instance,  consider  the  following  pronunciations  of  the  letter  a : 
syllable  /sllabal/,  syllabic,  /sslaeblk/,  syllabification,  /salsbafakesln/.    Here,  a 
represents  M,  Ixl  and  lei.  Particularly  for  multisyllabic,  morphologically  complex 
words,  vowel  pronunciation  is  largely  indicated  by  lexical  stress  which  is  not 
represented  in  English  script.  This  means  word  recognition  depends  largely  on  the 
reader  recognizing  the  morphemes  or  words  in  question.    Since  English 
orthography  tends  to  keep  morphemes  consistent,  (e.g.,  syllable,  syllabic, 
syllabification),  vowel  pronunciation  in  multisyllabic  words  is  determined  largely 
via  consistent  morphological  representation,  and  less  by  phonological  cues. 

Another  feature  of  English  orthography  that  contributes  to  vowel 
identification  is  the  contrast  between  single  and  double  consonants.  In  many  cases, 
double  consonants  do  not  contribute  consonantal  information,  however,  they  do 
give  information  about  the  preceding  vowel.  This  is  true  for  many  bisyllabic 
words.  The  double  consonants  in:  apple,  babble  distinguish  the  preceding  vowel 
/a?/  as  opposed  to  lei  in:  staple,  table.  The  same  is  true  with 

litter /I/ vs.  litre  III 
toddle /a/ vs.  total  lot 
setter  Id  vs.  cedar  HI 
syllable  III  vs.  stylish  /nil 
muffle  IN  vs.  scruple /u/ 

Similarly,  English  orthography  uses  the  double  vs.  single  consonant  contrast  to 
distinguish  vowels  in  some  words  that  are  made  multisyllabic  by  affixation.    When 
14  1  will  discuss  processing  evidence  for  this  structural  division  in  the  next  chapter. 
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the  one  syllable,  V  initial  suffixes  ( -ing,  -ed,  -er,  -y)  are  added  to  monosyllabic 
words  and  some  bisyllabic  (with  syllable  final  stress)  words  with  the  vowels  Is/, 
III,  Id,  /a/,  Ml  and  IN,  the  final  consonant  is  doubled  in  order  to  indicate  that  the 
preceding  vowel  remains  the  same.  For  instance, 

/aV  in  hatter   vs.  Id  in  liater 
IV  in  pinned  vs.  /ai/  in  pined 
1 1/  in  admitting  vs./  ai/  in  admiring. 
Id  in  hopping  vs.  hi  in  ho  ping 
Ml  in  inputting  vs.  Id  in  computing 
IN  in  drumming  vs  Id  in  consuming 

Though  this  fact  about  the  orthography  is  usually  explained  in  terms  of  a  spelling 
rule,  it  is  neither  arbitrary,  nor  applicable  only  to  root-plus-  affix  environments. 
The  double  consonants  are  a  feature  the  orthography  uses  for  distinguishing  among 
vowels'5,  which  applies  both  word  internally  and  across  morpheme  boundaries. 
Hence,  English  script  uses  letter  patterns  to  indicate  segmental  information. 

A  strictly  phonological  orthography  would  represent  the  words  as  they  are 
spoken,  regardless  of  phonological  alternations  that  are  a  result  of  morphological 
processes.  Yet  some  alphabetic  scripts  consistently  indicate  morphological 
structures  in  favor  of  phonological  ones.  For  instance,  English,  Czech  (Henderson 
1984),  Albanian  (Comrie  1996),  Ol  Cemet  (Zide  1996),  Korean'*  (King  1996)  are 
partly  morphophonemic.  That  is,  in  morphophonemic  environments,  they  often 
ignore  phonemic  contrasts  in  favor  of  consistently  representing  morphemes. 

As  previously  noted,  with  English  script  there  is  a  tendency  toward 
morphological  consistency.  This  is  true  for  both  inflectional  and  derivational 

15  In  some  cases  double  consonants  can  indicate  consonantal  rather  than  vowel  information   In 
cases  where  affixation  results  in  geminate  consonants,  double  consonants  indicate  consonant 
length.  For  example,  in  openness  /oplnnes/  and  unnatural  /Anmetr  1/,  double  consonants  are 
not  giving  vowel  information  but  indicating  consonant  length.  Also,  these  spellings  keep  the 
respective  morphemes  (e.g. ,  open  and  -ness  )  consistently  represented. 

16  an  alphabetic-syllabic  script 
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processes,  regardless  of  phonological  alternations.  For  instance,  inflectional 
affixes  may  be  variably  pronounced  while  being  graphically  consistent.  The  past 
morpheme  represented -erf  indicates  IM  in  tripped ,  Id/  in  scarred,  and  /Id/  in  parted 
.  The  plural  -s  indicates  IzJ  in  dogs  and  /s/  in  cats.  The  plural  -es  indicates  /Iz/. 
Where  different  inflectional  markers  are  phonologically  similar,  they  are 
distinguished  graphically.  For  instance,  the  -s  plural  and  the  -'s  possessive 
exhibit  the  same  phonological  alternations.  Also  -ing  is  frequently  reduced  in 
speech,  but  is  graphically  consistent. 

Likewise,  English  spelling  consistently  represents  many  derivational 
morphemes  regardless  of  predictable  phonological  alternations. 

attractlattract-ion 
infect  I  infect  -ious 
right/right -eous 

critic  I  critic  -allcritic  -ize 

plastic  I  plastic  -ize 

medic  -ine  /medic  -  in-allmedic  -ate/  medic  -al 

anxi-ouslanxi-ety 

Vowel  shifting  with  affixation  is  usually  not  represented  in  spelling.,  keeping  the 

base  morphemes  consistent. 

syllable/ syllab-ify 
declare/ declar-ation 
inspire/ inspir-ation 
ado  re/ado  r-ation 
sane/san-ity 
revise/revis-ion 
couragelcourag-eous 
illustrate!  illiistrat-ive 
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However,  in  some  cases,  morphophonemic  alternations  may  be  represented  in  the 
spelling. 

persuadelpersuas  ion 
decide/decision 
vain/vanity 
analysis/analyze 

Alternatively,  some  alphabetic  scripts  such  as  Italian,  Serbo-Croatian 
(Henderson  1984),  Turkish  (Comrie  1996)  favor  phonological  consistency.  Such 
scripts  are  quite  phonemically  consistent  and  do  represent  morphophonemic 
changes  in  pronunciation  in  the  spelling.    As  Henderson  (1984)  points  out, 
languages  in  which  morphological  processes  lead  to  a  lot  of  variation  in  the  surface 
forms  tend  to  favor  phonological  regularity  over  morphological.  Turkish,  for 
example  is  agglutinative  and  uses  extensive  vowel  harmony.  Hence,  words  tend  to 
be  very  long  and  have  a  reduced  number  of  vowel  distinctions,  making  it  necessary 
to  attend  to  consonantal  invariances. 

Due  to  a  number  of  factors  such  as  historical  change  and  linguistic 
preservation  pressures,  almost  all  alphabetic  scripts  have  some  phoneme-grapheme 
mapping  inconsistencies.  When  orthographic  systems  remain  unchanged  for  a  long 
period  while  pronunciations  have  changed  greatly,  some  scripts  (English,  French, 
Icelandic)  become  quite  lexicalized.  That  is,  when  the  phonological  cues  for 
pronunciation  of  the  word  do  not  reliably  predict  the  spelling,  they  must  be 
memorized  logographically.  English  has  many  common  words  like  this: 


been 

though 

enough 

laugh 

friend 
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love,  move 
have 

Also,  many  homophonous  words  are  distinguished  by  their  logographic 
representation  as  in 

Maine,  mane,  main, 
plain,  plane 

Summary  and  Concluding  Remarks 
Orthographic  systems  are  most  often  partial  representations  of  mul  tiple 
levels  of  linguistic  structure.  They  are  alike  in  their  simultaneous  use  of  multiple 
cues  for  word  identification.  Some  combination  of  phonological,  morphological, 
and  lexical  cues  appear  in  all  scripts  with  varying  degrees  of  explicitness. 
Moreover,  scripts  seem  to  share  the  characteristics  of  being  only  partially 
representative  of  speech  and  of  our  underlying  linguistic  knowledge. 

Many  of  the  world's  scripts  have  evolved  from  one  another  and  others  have 
arisen  independently.  There  is  little  doubt  that  particular  orthographies  have  been 
adapted  to  serve  their  particular  languages.  As  a  result,  scripts  are  as  many  and 
varied  as  are  languages.  Yet,  like  the  languages  they  represent,  scripts  are 
constructed  from  and  constrained  by  linguistic  structures  and  our  ability  to  process 
them.  I  believe  that  the  fact  that  diverse  orthographic  systems  share  such 
fundamental  characteristics  is  itself  evidence  of  human  preferences  and  constraints 
on  processing  visual  language. 

The  notion  that  visual  language  processing  is  influenced  by  the  linguistic 
information  presented  in  scripts  is  not  novel  to  the  psychology  of  reading.  What 
has  been  problematic  is  that,  to  a  great  extent,  reading  research  has  assumed  a  priori 
that  alphabetic  scripts  represent  phonemic  systems.  As  a  result,  most  theoretical 
and  experimental  work  has  focused  on  delineating  the  essential  phonemic  features 
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of  the  orthographic  system  and  how  they  are  best  learned.  In  doing  so,  many  of  the 
regularities  present  in  the  system  can  be  overlooked  or  misinterpreted.  An  appraisal 
of  a  variety  of  writing  systems  obliges  us  to  consider  the  idea  that  alphabets,  like 
various  other  scripts,  include  different  types  of  linguistic  information. 
Furthermore,  contrary  to  the  notion  that  a  complete  and  consistent  phonemic  script 
is  necessarily  optimal,  the  fact  that  all  orthographies  are  multileveled  and  incomplete 
indicates  that  there  is  some  processing  advantage  for  information  to  be  structured  in 
this  fashion.  The  following  chapter  will  discuss  psycholinguistic  evidence  that 
readers  of  various  scripts  do  indeed  extract  cues  form  the  various  levels  of  linguistic 
structure  represented  in  orthographies. 

The  most  important  implication  of  these  facts  to  the  overall  discussion  is  that 
learning  the  varied,  often  incomplete  features  of  an  orthographic  system  is  likely  to 
involve  far  more  than  what  is  provided  by  instruction.  Further,  many  of  scripts' 
regularities  are  not  obviously  indicated,  or  complex  enough  so  as  to  not  lend 
themselves  to  conscious  analysis  (particulary  by  young  children).  Consequently, 
orthographies  are  the  kinds  of  systems  that  are  likely  to  be  learned  implicitly. 


CHAPTER 4 
MULTILEVELED  REPRESENTATION-  PROCESSING  EVIDENCE 


Linguistic  taxonomies  of  writing  systems  have  some  interesting  implications 
for  theories  of  visual  processing,  word  recognition  and  reading.  Based  on  the 
assumption  that  humans  will  attend  to  patterned  stimuli  when  faced  with  a  learning 
task,  it  is  highly  likely  that  readers  utilize  the  structure  that  orthographies  present. 
Furthermore,  the  fact  that  multileveled  representation  is  the  rule  across  scripts  and 
script  types  suggests  that  there  is  some  processing  advantage  to  extracting  various 
linguistic  cues  at  once.  Since  spoken  language  is  comprised  of  several  structural 
levels,  it  follows  that  humans  would  be  predisposed  to  interpret  the  written  code  by 
making  connections  to  language  representations  already  in  place.  Moreover,  I 
suggest  that  it  is  precisely  because  of  human  processing  predilections  that  scripts 
tend  to  show  such  remarkable  commonalties.  Hence,  this  chapter  is  a  corollary  to 
the  previous  one.  If  reading  does  involve  interpreting  the  various  aspects  of 
structure  represented  in  scripts,  there  should  be  evidence  of  this  in  the  way  people 
process  written  language.  Since  there  has  been  a  great  deal  of  study  in  regard  to 
phonemic  structure  in  reading,  and  in  an  effort  to  emphasize  the  multi- 
dimensionality  of  visual  processing,  I  will  discuss  empirical  studies  which  point  to 
reader's  use  of  morphological  and  syllable  structure. 

Syllable  Structure  and  Language  Processing 
The  syllable  is  a  constituent  of  phonological  representation.  Though  it  has 
no  conclusive  phonetic  definition,  it  generally  corresponds  to  a  single  bump  in 
sonority  (perceived  loudness),  and  to  the  open-close  cycle  of  the  jaw.  The  syllable 
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is  the  smallest  separately  pronounceable  unit  of  speech  and  perhaps  the  most 
perceptually  salient.    Both  adults  and  young  children  agree  as  to  how  many 
syllables  there  are  in  a  given  form  and  agree  to  a  large  extent  where  the  boundaries 
are.  Furthermore,  spontaneous  speech  errors  affirm  the  reality  of  syllables  as  a 
speech  unit.  Spoonerisms,  where  an  intended  word  is  replaced  by  another,  usually 
have  the  same  number  of  syllables  as  the  intended  word.  Also,  with  the  "tip  of  the 
tongue  phenomena,"  when  searching  for  a  word  in  long-term  memory,  the  words 
tried  usually  have  the  same  number  of  syllables  as  the  sought-for  item  (Brown  and 
McNeill  1966). 

One  widely  accepted  proposal  for  the  internal  structure  of  the  syllable  is  that 
it  is  divided  into  onset  and  rime.  The  onset  is  element(s)  before  the  sonority  pea* . 
The  rime  consists  of  the  nucleus  (which  is  or  includes  the  sonority  peak)  and  coda, 
the  elements  after  the  nucleus. 
Fig.  3.1 

syllable 


onset 


nucleus      coda 
The  division  of  the  syllable  into  onset  and  rime  is  justified  on  linguistic 
grounds.  The  elements  of  the  rime  (nucleus  and  coda)  may  attract  stress.  For 
instance,  in  quantity  sensitive  stress  systems,  a  heavy  syllable  has  something  after 
the  sonority  peak.  Thus  CVV  and  CVC  are  heavy  and  attract  stress  but  CV  and 
CCV  are  not  and  so  do  not  attract  stress.  Elements  of  the  rime  may  be  a  tone 
bearing  unit.  In  some  languages,  a  tone  can  appear  on  the  peak  or  the  element  after 
the  peak  (the  second  segment  of  a  long  vowel  or  a  coda).  Vowel  length  may  be 
affected  by  elements  after  the  peak.  For  example,  in  some  languages,  a  long  vowel 
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can  only  occur  in  an  open  syllable.  Conversely,  onsets  are  not  distinguished  by 
stress  systems,  they  do  not  bear  tone,  nor  do  they  affect  vowel  length. 

There  is  behavioral  evidence  for  this  view  of  syllable  structure  from 
spontaneous  speech  errors.  Transpositions  occur  when  an  intended  sound  is 
replaced  by  a  sound  belonging  elsewhere  in  the  utterance.  In  these  cases,  people 
replace  onsets  with  onsets  and  rimes  with  rimes   (  Fromkin  1973;  MacKay  1972, 
Stemberger  1983).  Or  they  may  transpose  onsets  with  onsets,  peaks  with  peaks 
and  codas  with  codas  (MacNeilage  1985).    Additional  evidence  comes  from 
children's  language  games,  such  as  "pig  latin,"  in  which  segments,  onsets,  rimes, 
or  syllables  are  manipulated. 

There  is  also  experimental  evidence  which  affirms  the  perceptual  reality  of 
the  onset-rime  division  of  the  syllable.  Treiman  (1986)  conducted  experiments 
where  college  students  heard  pairs  of  CVCC  words  (e.g.,  /paskt/  and  /nAts  /) . 
They  were  asked  to  blend  the  two  words  into  one  new  word  by  taking  parts  of 
each.  Over  90%  of  all  responses  were  blended  by  taking  the  onset  on  one  word 
and  the  rime  of  the  other.  For  example,  blending  /paskt/  and  /nAts/  resulted  in  a 
C/VCC  division,  /pAts/  as  opposed  to  CV/CC  /pasts/  or  CVC/C  /pasks/ 

Even  young  children  generally  perform  well  at  phonological  awareness 
tasks  which  manipulate  syllables,  onsets  and  rimes.  For  instance,  in  rhyming  and 
alliteration  production  tasks,  some  very  young  children  (preschoolers)  demonstrate 
knowledge  of  spoken  syllables  at  onset/rime  boundaries  (Maclean,  Bryant  and 
Bradly  1987).   Alliteration  is  defined  as  knowing  when  2  words  share  an  entire 
onset  (e.g.,  bread  and  brick).  Kindergartners  perform  well  on  rhyming  and 
alliteration  production  tasks  (Stanovich,  Cunningham  and  Cramer  1984)  and  forced 
choice  tasks  (e.g.,  what  rhymes  with  bed  ?  sled  or  ring ).     In  another  series  of 
studies  done  by  Treiman  and  Zukowski  (1991),  preschoolers  and  kindergartners 
performed  well  on  word  pair  comparison  tasks  which  demonstrate  knowledge  of 
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onsets  such  as  plank  and  plea  vs.  twist  and  brain.  They  also  demonstrated 
knowledge  of  rime,  spit  and  wit  vs.  rail  and  snap.     In  phoneme  deletion  and 
substitution  tasks,  first  and  second  grade  children  have  a  very  high  rate  of  success 
when  asked  to  manipulate  an  entire  onset  or  rime.  However  the  error  rate  increased 
greatly  when  asked  to  manipulate  only  part  of  the  onset  or  rime  (Bruck  and  Treiman 
1990;  Treiman  1985). 

Syllable  Structure  in  Reading 
The  syllable  and  its  constituent  structure  are  highly  salient  phonological 
units  and  are  crucial  to  theoretical  linguistic  accounts.  This  perceptual  salience  has 
clearly  marked  the  development  of  writing  systems.  Syllabic  scripts  are  the  most 
common  type  of  orthographies  and  often  have  developed  from  alphabets  and 
logographies  (Daniels  1992).  Given  the  perceptual  salience  of  syllables,  it  is  likely 
that  readers  of  alphabetic  scripts  may  use  syllable  structure  to  recognize  words.  In 
fact,  there  is  evidence  from  English  that  this  is  the  case. 

Some  Evidence  for  Use  of  Onset  and  Rime 

As  stated  in  the  previous  chapter,  English  orthography  tends  to  consistently 
represent  onsets  and  rimes-  that  is  the  pronunciations  of  onsets  and  rimes  are  more 
consistent  than  phoneme  pronunciations.  This  is  particularly  true  for  vowel 
representation  in  English  where  every  vowel  symbol  has  multiple  pronunciations. 
However,  if  one  considers  the  letters  which  compose  the  entire  rime,  the 
pronunciation  of  vowel  symbols  are  quite  consistent.  For  instance,  -ast  is  always 
pronounced  /ast/,  where  bla-  can  be  pronounced  variably  as  in  blast,  blame,  blare, 
blah.    Also,  consider  the  vowel  symbol  ea.  It  may  be  pronounced  Id  as  in  steak, 
lit  as  in  beat,  or  Id  as  in  head.    For  words  that  end  in  -earn  and  -each,  ea  is  most 
likely  pronounced  HI.  Yet  for  words  that  end  in  -ead,  ea  is  most  likely  pronounced 
Id  (Treiman  1992).    Consequently,  the  group  of  letters  that  form  the  rime  are  much 
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more  likely  to  be  consistently  identified  and  pronounced  than  are  the  individual 
letters. 

It  has  been  suggested  that  this  consistent  property  of  English  orthography  is 
extracted  during  reading.   Treiman  (1992)  and  Goswami  and  Bryant  (1992) 
propose  that  children  favor  using  orthographic  units  that  correspond  to  onsets  and 
rime  because  they  are  more  perceptually  salient  than  are  segments.  Many  young 
children,  before  they  learn  to  read  are  aware  that  the  spoken  syllable  /blast/  consists 
of  the  units  /bl/  and  test/.  Accordingly,  it  has  been  suggested  that  children  will  tend 
to  use  orthographic  units  that  represent  salient  phonological  units.  As  a  result,  it  is 
likely  that  young  children  will  favor  using  onset  and  rime  constituents  in 
recognizing  and  pronouncing  written  words:  bl  ast,  as  opposed  to  b  last,  bla  st, 
bias  t,  or  even  blast.    In  this  view,  the  correspondence  between  bl  and  /bl/  is 
stored  as  a  separately  memorized  unit  and  can  be  used  whenever  bl  occurs  at  the 
beginning  of  a  word.  Similarly,  -ast  is  recognized  as  a  unit. 

In  fact,  there  is  some  convincing  empirical  evidence  which  suggests  that 
onsets  and  rimes  are  used  in  learning  to  read  English  words.  A  series  of  studies  by 
Goswami  (1986,  1988)  indicated  that  pre-reading  and  beginning  reading  children 
learned  monosyllabic  words  by  making  "analogy"  to  the  onsets  and  rimes  of 
previously  learned  words.    Goswami  performed  a  series  of  experiments  with 
children  from  kindergarten  (prereaders)  to  second  grade.  In  these  tests,  pairs  of 
"clue  words"  which  shared  the  same  orthographic  sequence  were  given.  The  clue 
words,  which  were  pronounced  for  the  children,  shared  either  a  rime  sequence  like 
peak-beak,  or  an  initial  CV(  V)  sequence  like  beak-  bean  .  Then  the  children  were 
asked  to  read  a  series  of  words  and  nonsense  words  that  possibly  shared  either  of 
these  orthographic  sequences  (e.g.,  leak  or  beam  ).  The  results  showed  that  while 
children  made  some  analogies  from  the  orthographic  sequences  of  the  initial  CV(  V), 
all  the  children  were  significantly  more  likely  to  make  analogies  from  the  rimes  of 
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words.  Even  the  prereading  children  were  capable  of  using  the  orthographic  rime 
sequences  to  identify  some  words  (Goswami  1986,  1988). 

Also,  studies  have  indicated  that  both  children  and  adults  are  better  at 
recognizing  nonwords  whose  rimes  are  orthographically  similar  to  real  words.  In  a 
experiment  by  Treiman,  Goswami  and  Bruck  (1990),  subjects  were  asked  to 
pronounce  words  like  tain,  goach,  and  others  like  goon,  taich.  The  first  two 
nonwords  share  their  rime  with  a  number  of  real  words  like  rain,  main,  coach, 
roach.  But  the  last  two  share  their  orthographic  rime  with  few  or  no  real  words.  If 
just  phoneme/grapheme  correspondences  were  being  used,  subjects  should  respond 
equally  well  on  both  types-they  all  contain  the  same  graphemes.  However,  all 
subjects  performed  better  with  nonwords  with  familiar  real  word  orthographic 
rimes,  e.g.,  tain  shares  its  rime  with  many  real  words  such  as  main,  rain,  pain, 
drain,  etc.   The  more  real  words  that  shared  the  orthographic  rime  of  the  nonsense 
word,  the  better  the  subjects'  performance.  The  number  of  real  words  that 
contained  the  nonword's  initial  CV  sequence  had  no  effect . 

Children's  spelling  errors  offer  additional  evidence  of  the  onset-rime 
division  of  the  syllable.  Children  have  difficulty  analyzing  a  complex  onset  into  its 
constituent  phonemes.  Hence,  they  will  often  spell  an  onset  cluster  with  the  initial 
consonant  only  (e.g.,  Wow  is  spelled  bow  and  stay  is  spelled  ray).  Treiman 
(1993)  suggests  that  this  reflects  the  isolable  nature  of  the  syllable  onset.  These 
omissions  do  not  reflect  children's  mispronunciations  since  children  at  this  age  can 
pronounce  the  words  correctly.  Neither  do  these  omissions  reflect  the  sonority  of 
clusters  since  the  second  element  of  a  cluster  was  likely  to  be  deleted  whether  it  be 
an  obstruent  or  sonorant.   Treiman  concludes  that  it  is  likely  that  this  pattern  of 
spelling  errors  reflects  children's  tendency  to  analyze  onsets  as  a  single  unit. 
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Some  Evidence  for  use  of  Syllable  Boundaries 

Orthographic  regularity  "  is  the  predictability  of  letter  patterns  within  a 
word.  Readers  have  been  shown  to  extract  general  information  about  the  frequency 
and  position  letter  sequences,  even  when  they  do  not  refer  to  discrete  linguistic 
units.  For  example,  ng  is  a  regular  bigraph  that  occurs  syllable  finally  and  across 
syllable  boundaries  while  gn  never  occurs  word  initially  or  finally,  and  only  across 
syllable  boundaries.  Taft  (1991)  has  demonstrated  that  lexical  decision  times  to 
nonwords  like  flink,  with  legal  orthographic  strings  are  slower  than  to  random 
strings  like  IJkni,  indicating  that  readers  are  sensitive  to  statistical  regularities  within 
words.  Young  children  (prereading)  who  have  had  print  exposure  will  judge 
sequences  like  AAAAA  as  unreadable,  indicating  that  they  have  already  abstracted 
some  letter  frequency  information  (Sinclair  and  Papandropoulou  1984).  Also, 
Treiman  (1993)  found  that  young  children,  kindergarten  through  second  grade, 
already  have  some  knowledge  of  orthographic  regularities  and  that  this  knowledge 
increases  with  age  and  print  exposure. 

There  is  also  evidence  that  such  orthographic  regularities  mark  syllable 
boundaries,  and  readers  use  this  information  for  recognizing  words.  For  instance, 
when  syllable  boundaries  are  marked  in  the  orthography  by  bigrams  that  only  to 
appear  in  different  syllables  (e.g.,  ABHOR  and  AN  V1L),  readers  are  likely  to 
utilize  this  information  to  recognize  words  (Prinzmetal,  Treiman  and  Rho  1986; 
Seidenberg  1987).  In  these  studies,  a  word  (ANVIL)  is  printed  in  two  colors 
(anVIU«)  and  displayed  tachistoscopically  for  a  duration  that  produces  about  10% 
errors  over  trials.  Subjects'  task  was  to  report  the  color  of  a  target  letter,  e.g.  V. 
The  reasoning  behind  this  method  was  that  if  subjects  use  syllabic  units  during 
recognition,  they  should  not  respond  to  the  target  letterV  incorrectly  with  the  color 


17  Some  times  called  orthographic  redundancy. 

1 8  Different  colors  are  represented  here  by  upper-lower  case 
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of  N  because  the  syllable  boundary  should  act  as  a  barrier  to  feature  integration 
errors.  Conversely,  if  the  display  were  ANVil,  subjects  may  tend  to  report  that  V 
was  the  color  of  U  because  it  was  part  of  the  same  syllable.  Errors  of  the  first  type, 
"violation  errors,"  crossed  the  syllable  boundary;  while  errors  of  the  second  type, 
"preservation  errors,"  preserved  the  syllable  boundaries.  It  is  thought  that  if 
syllables  influence  the  pattern  of  feature  errors  there  should  be  more  preservation 
than  violation  errors. 

Prinzmetal  et  al.  ( 1986)  who  found  that  four  of  five  experiments  yielded 
more  preservation  than  violation  errors,  concluded  that  syllables  do  influence  word 
identification.  However,  the  stimuli  in  the  four  experiments  that  show  syllable 
effects  were  words  in  which  the  syllable  boundary  bigrams  only  appear  in  different 
syllables  (e.g.,  AN  V1L),  or  where  syllable  and  morphological  boundaries  coincide 
(e.g.,  TODAY).   The  experiment  that  did  not  produce  the  syllable  effect  used 
words  where  the  syllable  boundary  was  not  marked  orthographically.  For 
example,  the  letters  that  mark  CAMEL  and  SA  LAD  do  not  uniquely  occur  at 
syllable  boundaries.  These  results  were  replicated  by  Seidenberg  (1987).  He 
conducted  similar  experiments  where  he  also  found  that  readers  are  aware  of 
syllable  boundaries  where  they  are  marked  by  orthographic  patterns". 


W<lltatTr8  SE^iS  thiS  iS  true  n0t  just  in  words  Ilke  ANVIL  whe«  «■  pliable  boundary 
bigrams  only  appear  in  different  syllables,  but  also  words  like  LAPEL  and  SONIC  where  if  ycT 

^M?£CTl      bl?T  IJIT^  P"*«ling,  Addling  and  following  the  syllable  boundary 
rfS,         f^T  "*  mdlCaled   Tha' iS'  Sy"able  inf°™alion  can  be  induid  from  the  overall 
SSSfi!  J         T™'  "0l  JUSl  *•  Sy"able  b0™^"-  Hence,  Seidenberg  concluded  that 
win tu^dTT  m  °nh0«raPbjeS  fa"S  °Ut  0f  ktto  [K1»™y  Pa«<™  S  general  that  reade" 
w.U  attend  to  all  recurring  patterns  However,  I  believe  there  are  some  problems  with  drawing  such 
conclusions  from  this  experiment.  First,  the  stimuli  include  5-letter  words  where  Seid™/ 
assumes  that  the  syllable  boundary  is  after  the  second  (LA/PEL)  or  third  (SON/IC)  lette     This 
syllabic  division  of  words  Hke  SONIC  is  arguable-i,  can  be  considered  LbisyUabl where dL 
is  no  real  boundary  between  the  syllables    Also,  the  enure  stimuli  list  ,s  no.  pven  so  it  is  unclear 

(Tfe  fSpED  wh!SenUally  PlaHym8  T1^  "■  ^  ^  CV/CVC  words  A  be  °^ S 
hke  LAPEL),which  may  give  the  reader  additional  cues  about  the  syllable  boundaries   Hence 
subjects  awareness  of  syllable  boundaries  in  stress-final  words  like  LAPEL  may  be  influenced  by 
phonological  factors  (e.g.,  stress),  not  simply  letter  pattern  frequencies  * 
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Morphological  Structure  and  Language  Processing 
Though  discussions  about  the  nature  of  lexical  representations  is  ongoing, 
morphological  structures  have  been  central  to  linguistic  accounts.  Despite  debates 
about  whether  morphological  structures  are  represented  in  a  "lexicon"  and 
assembled  via  rules  or  are  governed  strictly  by  syntactic  principles,  it  is  generally 
agreed  that  morphemes  are  the  elements  of  word  formation  and  interact  with 
syntactic  and  phonological  processes.  There  is  visible  evidence  of  speaker's 
knowledge  of  morphology  from  speech  production  and  perception.  Native 
speakers  know  the  system,  (e.g.,  derivation,  inflection,  compounding)  for 
combining  morphemes  into  words.  Children  and  adults  coin  and  understand  new 
words  using  productive  morphology.  Spontaneous  speech  errors  involve 
transposition  of  morphemes  (Stemberger  1983). 

In  considering  morphology  from  a  processing  perspective,  Bybee  (1995) 
points  out  that  morphological  form  and  meaning  are  a  consequence  of  human 
processing  capacities.  For  instance,  in  the  context  of  word  formation  and  language 
change,  in  the  gradual  progression  from  a  lexical  morpheme  to  grammatical  one, 
changes  occur  in  that  morpheme's  phonological  shape,  meaning,  and  grammatical 
behavior.    It  is  because  bound  morphemes  are  processed  differently  that  they 
develop  properties  that  distinguish  them  from  independent  words.  In  this  sense, 
morphological  structures  are  themselves  evidence  of  morphological  processing.  I 
believe  that  there  is  an  analogous  point  for  visual  processing.  Orthographies 
indicate  morphological  structures  in  various  ways  because  morphology  is  a 
component  of  visual  language  processing. 

Yet,  theoretical  accounts  of  word  recognition  and  reading  have  tended  to 
minimize  the  role  of  the  morpheme.  Reading  theories  such  as  phonemic  receding, 
whole  word,  and  dual  route  theories  generally  maintain  that  words  are  identified 
either  by  reference  to  individual  phonemes  or  whole  words.  Similarly, 
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psycholinguistic  word  recognition  models  that  have  been  proposed  often  discount 
morphological  processes.  For  instance,  in  Search  Models  such  as  Forster's 
( 1976),  initial  letters  of  a  word  are  used  to  identify  a  'bin'  of  similarly  spelled 
words  and  the  bin  is  searched  serially  for  an  element  which  is  sufficiently  similar  to 
the  stimulus  word  until  a  match  is  found.  Since  such  models  predict  the  first  few 
letters  of  a  word  will  be  effective  to  access  the  lexicon,  they  assume  whatever 
phonological  or  morphological  information  is  encoded  at  the  ends  of  words,  is  not 
relevant  to  word  recognition.  A  somewhat  similar  treatment  of  word  recognition  in 
speech  perception  is  Marslen-Wilson's  (1980)  Cohort  Model.    Cohort  theory 
asserts  that  stimulus  information  is  used  from  the  beginning  of  a  word  onwards  to 
reduce  uncertainty  until  word  identification  is  made.  That  is,  starting  from  the 
beginning  of  a  word,  when  the  word  becomes  uniquely  distinguishable  from  all 
other  words  in  the  language,  identification  is  made.  Again,  this  model  specify 
different  treatment  for  morphologically  complex  forms,  nor  does  not  define  how 
word-final  morphological  information  plays  into  rapid  identification. 

Still,  there  have  been  some  models  that  have  attempted  to  specify  a  level  of 
morphological  processes  in  word  recognition.  In  Decomposition  models  (Taft  and 
Forster  1975,  Chialant  and  Caramazza  1995)  lexical  representations  consist  of 
morpheme  sized  units,  or  both  morpheme  and  word  sized  units.  The  challenge  for 
this  type  of  model  is  to  accommodate  the  different  effects  that  are  found  in  the 
processing  of  regular  and  irregular  complex  words  (e.g. ,  walked  and  went).  Also 
problematic  for  decomposition  models  is  that  morphologically  complex  forms  that 
are  very  frequent  fail  to  show  significant  morphology  effects. 

Interactive  (spreading)  activation  models,  such  as  Seidenberg's  (1987)  and 
Seidenberg  and  McClelland  (1989),  address  the  issue  of  words'  frequency  effecting 
recognition  regardless  of  their  complexity.   In  this  model,  the  more  frequently 
connections  are  activated,  the  stronger  the  connections.  Hence,  high  frequency 
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items  need  less  activation  than  low  frequency  ones  to  be  accessed  and  are 
recognized  more  quickly.  However,  in  these  models,  morphological  structures  are 
not  directly  represented  but  are  merely  (possibly)  a  byproduct  of  the  transitional 
probabilities  of  letter  sequences. 

A  somewhat  different  attempt  to  account  for  morphological  processes  in 
word  recognition  is  Taft's  (1979,  1987)  BOSS-Basic  Orthographic  and  Syllabic 
Structure-  model.  He  proposes  that  readers  use  access  units  called  BOSSes, 
which  synthesize  syllabic  and  morphological  information  without  necessarily 
referring  to  a  syllable  or  a  morpheme  per  se.  A  BOSS  is  the  first  part  of  the  stem 
morpheme  of  a  word,  up  to  and  including  all  consonants  following  the  first  vowel, 
but  without  creating  an  illegal  consonant  cluster  in  its  final  position,  e.g., 
FAST/ER.  Empirical  studies  which  have  tried  to  assess  the  perceptual  reality  of 
BOSSes  have  had  conflicting  interpretations  (Taft  1979,1987;  Seidenberg  1987; 
Lima  and  Pollatsek  1983). 

Morphological  Structure  in  Reading 
Perhaps  the  most  important  (and  testable)  prediction  the  various  reading 
theories  and  word  recognition  models  make  is  that  there  should  be  (or  not)  some 
differences  that  show  up  in  the  processing  of  morphologically  simple  versus 
morphologically  complex  forms.  For  instance,  for  models  that  do  not  specify 
morphological  processing,  there  either  should  be  no  difference  between  processing 
of  simple  and  morphologically  complex  forms  or  such  differences  should  be 
accountable  for  by  other  means.  Conversely,  if  morphology  is  functional  in  word 
recognition  we  would  expect  to  see  differences  in  the  way  people  process 
morphologically  simple  versus  complex  words.  Some  empirical  studies  have 
attempted  to  discover  whether  morphology  does  influence  the  processing  of  written 
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words.  The  following  is  a  brief  survey  of  studies  that  have  been  taken  as  evidence 

for  visual  morphological  processing. 

Some  Experimental  Evidence  of  Morphological  Processing 

Priming  effects  have  been  taken  as  indicators  of  the  role  of  morphology  in 
processing  written  language.  Feldman  ( 1987)  has  presented  evidence  that  Serbo- 
Croatian  nouns,  which  can  be  inflected  by  affixation  to  six  cases  (singular  and 
plural)  other  than  the  nominative  and  form  derived  forms  by  adding  a  diminutive 
affix,  show  priming  effects.  For  example,  diminutives  can  be  formed  from  base 
forms  such  as  start  "apartment"  and  kora  "crust"  by  adding  one  of  the  suffixes  - 
cic,  -tea,  -ence,  -ak  (e.g.,  stancic,  korica ),  where  choice  of  suffix  is  constrained 
by  gender  of  the  noun.    This  was  a  repetition  priming  experiment  where  base 
forms  are  targets,  they  are  preceded  (7-13  items  earlier)  by  a  prime  which  was 
either  the  identical  word  or  a  morphologically  complex  form.  In  this  lexical 
decision  task,20  primes  were  either  the  base  forms,  the  diminutive  form,  or  a 
"psuedodiminuative  "-an  unrelated  monomorphemic  word  whose  construction 
inappropriately  suggests  that  it  contains  the  same  base  form  plus  a  diminutive  affix 
(stanica,  korak).  The  results  indicated  that  decision  latency  to  the  target  was 
function  of  which  type  of  prime  preceded.  Both  base  word  and  diminutive  primes 
significantly  reduced  target  decision  latencies  while  pseudodiminutives  has  no  such 
effect.  That  is,  subjects  recognized  the  targets  as  words  significantly  faster  when 
they  were  primed  by  morphologically  related  forms.  Hence,  these  results  show 
significant  facilitation  for  morphological  relatives  and  no  facilitation  for 
phonologically  and  orthographically  similar  but  unrelated  words. 

In  another  priming  experiment,  Laudanna  et  al.  ( 1989)  presented  evidence 
from  Italian  that  suggested  that  morphologically  related  words  will  prime  each  other 
while  merely  orthographically  similar  ones  do  not.   In  a  condition  where  the  degree 


1  Lexical  decision  tasks  involve  the  subjects  deciding  whether  the  target  is  a  word  or  not. 
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of  orthographic  similarity  between  words  are  controlled,  only  prior  presentation  of 
a  morphologically  related  word  such  as  portare  ,  "to  bring"  and  not  merely  a 
orthographically  similar  word  such  as  pone ,  "doors",  affects  lexical  decision 
performance  for  a  target  word  porta  ,  "bring."  Lexical  decision  for  such  pairs  is 
faster  when  prime  and  target  are  morphologically  related.  That  is,  the  presentation 
of  portare,  "to  bring"  facilitates  recognition  of  porta  , "  (you)  bring"  but  not  porte 
"doors,"  even  though  they  are  all  visually  similar.   This  is  taken  to  indicate  that 
lexical  access  and  storage  is  not  determined  simply  by  orthography  or  phonology, 
but  also  morphology. 

Evidence  suggests  (Caramazza  et  al.  1988  in  Chialant  and  Caramazza  1995) 
that  lexical  decision  is  significantly  effected  by  words'  payability  into  morphemes. 
In  an  experiment  where  orthographic  similarity  was  controlled  and  word  frequency 
was  matched,  it  was  found  that  subjects  are  slower  and  produce  more  errors  when 
rejecting  non  words  that  can  be  parsed  into  actual  root  and  affixes.  For  example, 
Italian  cant-evi  and  English  walk-est  are  non  words  that  are  composed  of 
morphemes  that  do  occur  in  the  respective  languages.  In  a  lexical  decision  task, 
subjects  are  much  slower  rejecting  such  forms  as  non  words,  than  when  rejecting  a 
non  word  that  contains  a  pseudomorpheme  (e.g.,  Italian  cant-ovi  and  English 
walk-ost).  This  is  interpreted  to  mean  that  subjects  are  attempting  to  use  whatever 
morphological  information  that  is  present  to  identify  the  word.  As  a  result,  it  is 
easier  to  identify  nonwords  constructed  with  pseudomorphemes.  Accordingly, 
subjects  were  faster  still  when  nonwords  contain  no  morpheme-like  sequences  at  all 
(e.g.  canz-ovi,  wilk-ost),  indicating  that  when  there  are  no  recognizable  morphemes 
in  a  pseudoword,  subjects  easily  reject  them  as  word  options. 

In  a  segment  shifting  task  developed  by  Feldman,  Stoltz  and  Feldman 
(1995)  compare  subjects'  ability  to  segment  words  along  morphemic  boundaries 
with  their  ability  to  break  up  unitary  morphemes.  This  method  was  an  attempt  to 
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simulate  spontaneous  speech  errors  (where  morphemes  are  transposed)  which  are 
taken  to  be  evidence  of  morphological  processing  in  speech  production.  The  task  is 
to  take  part  of  the  source  word  and  attach  it  to  the  target,  forming  a  morphologically 
complex  word  and  to  name  the  resulting  word  as  rapidly  as  possible.  For  example, 
the  source  word  is  either  harden_  or  garden  while  the  target  word  is  bright.  The 
dependent  variable  is  the  time  necessary  to  shift  the  designated  segment  -en  from 
the  source  to  the  target,  bright~>brighten.  If  it  takes  longer  to  attach  the 
morpheme-en  to  the  target,  bright ,  after  being  primed  with  the  single  morpheme 
garden  than  it  does  after  the  morphologically  complex  liarden,  then  this  argues  for 
morphological  decomposition  in  processing.  Results  indicated  that  the 
morphologically  complex  source  words  (e.g.,  harden,  darken,  soften)  produced 
faster  shifting  latencies  than  simple  source  words  (e.g.,  garden )  even  though  the 
complex  words  had  a  lower  average  frequency.  Furthermore,  source  words  were 
selected  so  that  several  letters  close  to  the  shifted  portion,  along  with  the  shifted 
portion,  represented  the  same  phonemes.  Therefore,  it  is  unlikely  the  outcome 
reflects  phonological  factors,  but  instead  that  readers  are  sensitive  to  morphological 
structure.  A  similar  study  of  Serbo-Croatian  was  done  by  Feldman  (1994)  with 
similar  results. 

So  far  evidence  of  morphological  processing  has  been  with  languages 
(Italian,  English,  Serbo  -Croatian)  in  which  complex  words  are  constructed  by 
appending  affixes  to  a  stem,  which  is  often  a  word  itself.  A  study  by  Bentin  and 
Frost  (1995)  replicates  Feldman's  segment  shifting  experiment  with  Hebrew,  a 
language  where  words  are  composed  of  "roots"  +  "word  patterns"  which  are 
productive  but  are  nonconcatenative.  Roots  are  composed  of  a  series  of 
consonants.  Word  patterns  are  vowel-melodies  which  "break  up"  the  consonantal 
root  Words  are  formed  by  "mounting"  word  pattern  morphemes  onto  the  root* 

21  The  authors  are  considering  the  word  pattern  as  well  as  the  root,  which  usually  represents  a 
general  concept,  morphemes. 
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They  hypothesize  that  if  morphological  effects  generalize  across  different  types  of 
morphologies,  and  over  segments  that  are  not  final  syllables,  then  morphology 
effects  would  show  up  with  Hebrew  as  well  as  English  and  Serbo-Croatian.  That 
is,  if  shifting  infixed  word  patterns  show  the  same  effects  as  shifting  concatenative 
suffixes,  this  would  indicate  an  effect  of  morphological  organization  on  parsing 
words  that  cannot  be  attributed  to  simple  visual  analysis  of  the  letter  string22.   In 
this  experiment  both  C's  and  V's  are  represented,  V's  either  by  diacritics  or  letters. 
For  example,  a  source  word  might  be  (transliterated)  moked  .  The  task  is  to  extract 
the  vowel-morpheme  [-o-e-]  from  the  root  [m-k-d],  and  to  combine  it  with  a  target 
string  of  consonants23  as  quickly  as  possible  in  order  to  form  a  new  word.  There 
are  2  types  of  roots:  1)  Productive  roots  that  can  attach  with  several  different  word 
pattern  morphemes  and  2)  non-productive  roots  which  can  only  have  one  word 
pattern  morpheme  (indicating  it  is  not  decomposed  morphologically).   The  authors 
reason  that  if  reading  in  Hebrew  employs  morphological  analysis,  it  should  be 
easier  to  detach  word  patterns  from  productive  roots  than  from  non-productive 
roots.  The  results  indicated  that  subjects  were  much  faster  with  productive  root 
primes  than  with  non-productive.  The  authors  concluded  that  in  Hebrew  word 
identification,  roots  and  word  patterns  are  psychologically  distinct,  and  also  that 
there  is  a  stage  where  they  are  separated  during  word  identification.  This  is 
interesting  since  generally  in  connected  text  word  patterns  are  not  represented  at  all 
and  must  be  supplied  by  the  reader.  Hence,  it  would  seem  that  Hebrew 
orthography  would  oblige  morphological  decomposition  and  word  pattern 
morphemes  would  be  inherently  separable  from  their  root.  However,  these  data 


22  Some  researchers  (e.g.,  Seidenberg's  1987  position  on  (his  has  been  discussed)  assert  that 
extracting  morphological  patterns  are  simply  a  byproduct  of  visual  analysis  of  the  phonemic  letter 
stang,  NOT  that  morphology  has  an  effect  on  parsing  words. 

The  target  string  of  consonants  was  not  a  meaningful  root   Therefore,  mounting  the  word 
pattern  on  it  resulted  in  forming  a  pseudoword.  The  authors  argue  that  this  is  procedure  insured 
that  the  results  could  not  be  accounted  for  by  other  factors  such  as  similarity  or  dissimilarity  in 
word  class  between  the  source  and  the  target  (Bentin  and  Frost  1995). 
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indicated  that  non-productive  words,  though  written  as  though  roots  and  word 
patterns  are  distinct,  are  accessed  as  whole  lexical  items.  In  general,  this  seems  to 
indicate  that  there  is  some  kind  of  processing  distinction  for  bound  versus  free 
morphemes/words. 

Logographic  scripts,  where  morphemes  are  explicitly  represented,  would 
seem  to  necessitate  morphological  analysis  by  the  reader.   Taft  and  Zhu  (1995) 
investigate  the  status  of  morphemes  vs.  whole  word  representations  in  Chinese 
writing  with  a  series  of  experiments  using  "binding  words."  Chinese  characters  are 
usually  said  to  be  equivalent  of  single  morphemes.  Each  word  is  made  up  of  one  or 
more  (usually  two)  characters/morphemes,  where  each  character  is  usually 
monosyllabic.    For  example,  the  character  for  mil  means  "cleanse"  and  yu  means 
"bathe."  The  two  characters  together,  mu  yu  mean  "bath".     However,  not  all 
characters  are  free  morphemes.  Some  characters  can  only  occur  with  other 
characters  to  form  compounds  or  "binding  words."  For  example:  Hao   means 
"vast"  but  is  never  used  on  its  own,  to  use  it  must  be  combined  with  something  like 
hao  da  where  da  means  "big."  All  characters  are  spatially  distinct  so  there  is  no 
way  to  tell  orthographically  if  they  are  biding  words  or  not.  Certain  characters  in 
binding  words  only  occur  in  first  position,  certain  others  only  in  second  position. 
Further,  some  binding  word  morphemes  are  unique  (non-productive)  in  that  they 
occur  only  in  a  single  word,  while  others  can  occur  in  several  words. 

In  one  experiment  which  used  a  naming  task,  subjects  were  faster  at  naming 
an  isolated  first  morpheme  of  a  binding  word  than  the  second.  The  authors  initially 
conclude  from  this  that  the  morphemes  of  binding  words  do  not  have  their  own 
lexical  representation  and  the  whole  word  must  be  accessed.  In  another  experiment 
the  authors  test  the  idea  that  a  morpheme's  productivity  determines  whether  or  not  it 
is  independently  represented  .  They  found  that  non-unique  binding  morphemes 
(those  that  occur  in  several  words)  were  more  easily  accessed  than  those  that 
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occurred  in  just  one  word.  They  conclude  from  this  that  characters  develop 
independent  representations  when  their  status  as  a  morpheme  has  been  fully 
established-which  involves  each  morpheme/character  developing  of  a  semantic 
function.  Hence,  characters  that  are  semantically  independent  are  taken  to  be 
independent  morphemes  and  are  recognized  as  such.   Other  characters  that  are 
semantically  bound  to  another  character  can  only  be  recognized  by  accessing  the 
whole  word.  It  appears  that  in  Chinese,  like  Hebrew,  words  that  are  ostensibly 
polymorphemic  but  non  productive  seem  to  be  represented  as  wholes.  Hence,  a 
conservative  conclusion  would  be  that  in  general,  bound  morphemes  seem  to  have  a 
different  psychological  status24  than  free  morphemes  and  that  difference  shows  up 
in  visual  processing. 

Evidence  of  Morphological  Processing  from  Children's  Spelling 

In  a  study  that  followed  two  classes  of  first  grade  children  over  the  school 
year,  Treiman  (1993)  found  that  children  are  significantly  more  likely  to  omit  the 
final  consonant  from  a  word  when  it  is  a  regular  inflectional  morpheme  than  they 
are  if  the  word  is  monomorphemic  or  is  an  irregularly  inflected  word.  For 
example,  killed  and  build  are  similar  phonologically  but  children  are  more  likely  to 
omit  the  -ed  from  killed  than  the  d  from  build.    Similarly,  children  were  less 
likely  to  omit  the  final  consonant  from  irregularly  inflected  words  like  found  or 
feet,  than  they  are  from  killed.  Omissions  are  most  common  when  the  suffix  can  be 
predicted  from  the  stem,  subject  to  regular  inflection.  This  error  pattern  is 
potentially  a  result  of  children's  recognition  of  morphological  boundaries.  This  is 
particularly  likely  because  no  phonological  reason  was  found  for  this.  For 
example,  it  was  not  the  case  that  they  omitted  final  C's  to  avoid  consonant  clusters 
because  children  did  not,  as  a  rule,  do  this  anyway.  Nor  does  the  duration  of  the 

24  The  psychological  status  of  bound  versus  free  morphemes  should  depend  on  a  number  of  factors 
such  as  productivity,  syntactic  application,  morphological  typology  of  the  particular  language, 
e.g.,  isolating,  agglutinative,  fusional,  and  so  on. 
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final  C's  explain  this  omission.    Since  in  adult  speech  final  consonants  are  actually 
longer  when  they  are  inflectional  suffixes,  one  would  expect  them  to  be  more  likely 
to  be  represented,  which  they  are  not.  Treiman  suggests  that  a  possible  explanation 
for  this  morpheme  omission  is  that  children  may  forget  they  are  writing  the  past 
tense  since  they  write  so  slowly.  However,  she  notes  that  when  reading  back  what 
the  child  has  written  about  yesterday,  the  child  will  say  HAvd  I  where  she  has 
written  love.  This  tendency  seems  to  indicate  at  least  that  it  is  clear  to  the  children 
that  the  word  is  recognizable  without  the  inflection,  that  inflection  is  a  separable 
something  extra,  the  necessity  of  which  is  not  yet  entirely  clear. 

Some  Evidence  from  the  Evaluation  of  the  Reading  Impaired 

Reading  disabled  subjects  (classified  dyslexic)  very  often  show  significantly 
poorer  oral,  as  well  as  written,  language  processing  than  do  normal  readers.  This 
fact  has  often  been  hypothesized  to  mean  that  dyslexia  is  some  kind  of  general 
language  disorder.  Therefore,  it  is  an  impairment  that  effects  both  the  auditory  and 
visual  modalities.  It  is  further  hypothesized  that  this  indicates  a  link  between 
phonological  and  visual  processing  and  is  taken  as  evidence  that  reading  involves 
receding  of  phonological  structure.  However,  the  scope  of  oral  language  problems 
that  are  associated  with  dyslexics  suggests  that  linguistic  deficits  beyond 
phonological  are  present.    In  some  current  work  by  Lombardino  ,  Riccio,  and 
Hynd  (in  press),  reading  disabled  (high  IQ)  subjects  performed  significantly  more 
poorly  on  an  evaluation  of  language  fundamentals  in  all  areas  of  language 
processing  than  did  normal  readers.  The  evaluations  were  presented  orally.  The 
areas  tested  were  subjects'  knowledge  of  word  class,  semantic  relationships, 
sentence  formulation,  sentence  recall,  sentence  assembly  and  word  structure 
(simple  derivation  and  inflection).   If  the  hypothesis  that  dyslexia  is  a  general 
language  disorder  and  the  auditory  and  visual  modalities  are  both  functional  in 
reading  is  correct,  then  this  seems  to  be  evidence  that  interpreting  multiple 
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linguistic  levels  is  pan  of  the  reading  process.   That  is,  this  disorder  is  effecting  all 

(tested)  areas  of  language  processing  and  its  effects  are  showing  up  in  both  reading 

and  speech  perception  and  production. 

The  notion  that  dyslexia  is  a  whole  language  learning  disorder  points  away 

from  the  hypothesis  that  dyslexia  is  particularly  a  result  of  some  phonological 

weakness.  Scarborough  (1991)  suggests  that  oral  language  processing  deficits 

often  precede  reading  disabilities  and  these  oral  deficiencies  show  up  in  syntactic, 

morphological  as  well  as  phonological  abilities  of  preschool  children.  Scarborough 

followed  the  development  of  a  group  of  children  from  two  to  eight  years  in  order  to 

compare  the  development  of  children  who  became  reading  disabled  with  those  who 

became  normal  readers.  The  children  in  these  groups  were  evaluated  at  ages  30, 

36, 42, 48  and  60  months,  at  which  times  various  tests  and  naturalistic 

observations  were  recorded.  These  evaluations  involved  measuring  sentence 

comprehension,  grammatical  complexity,  and  measuring  the  length  of  utterances 

by  counting  morphemes  and  productive  compounds.  The  results  indicated  that 

"syntactic  abilities""  at  age  30  months  were  predictive  of  Grade  2  reading  ability. 

Dyslexic  children  showed  deficiencies  in  sentence  comprehension  and  syntactic  and 

morphological  complexity  throughout  much  of  the  preschool  period  that  normal 

reading  children  did  not.  However,  by  age  5,  oral  language  difficulties  were  no 

longer  evident  by  the  tests  given.  This  is  the  potential  result  of  a  number  of  factors. 

Scarborough  notes  that  one  of  which  may  be  simply  that  these  tests  were 

insufficiently  sensitive  to  the  linguistic  proficiency  of  older  (5  and  over)  children. 

The  point  relevant  to  the  present  discussion  is  that  this  study  indicates  that  dyslexia 

is  not  simply  a  phonological  disorder  or  a  reading  disorder.  Instead,  it  seems  to  be 

a  developmental  language  disorder  that  affects  multiple  levels  of  linguistic 

processing.  The  fact  that  this  disorder  significantly  influences  reading  ability  is 

25  Syntactic  abilities  defined  and  measured  by  the  Northwestern  Syntax  Screening  Test  (sentence 
comprehend),  the  Index  of  Productive  Syntax  and  the  Mean  Length  of  Utterance  evatation 
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evidence  that  reading  involves  multi-leved  linguistic  interpretation.    That  this 
disorder  is  more  problematic  for  reading  than  for  speaking  is  a  complex  issue  that 
has  not  been  resolved. 

Neurobiology  of  Multileveled  Processing 
Evidence  from  the  Neuroanatomy  of  Dyslexics 

I  have  argued  that  there  must  be  a  functional  reason  that  orthographies  are 

organized  to  include  several  different  types  of  information  at  once,  and  also  that  it 

appears  that  readers  interpret  scripts'  many  different  properties  simultaneously. 

Neurobiological  research  indicates  that  there  is  no  single  mechanism  in  the  human 

brain  that  is  able  to  extract  all  the  information  from  an  auditory  or  visual  stimulus. 

Hence,  the  stimulus  is  broken  up  and  little  bits  of  the  information  are  processed  by 

different  subsytems  and  along  different  pathways  Galaburda  (1996)  contends  that 

this  general  property  of  the  brain  applies  to  processing  linguistic  stimuli.   That  is, 

we  are  at  all  times  splitting  up  the  stimulus  in  order  to  analyze  its  particularly 

different  properties.  We  do  this  at  various  stages,  and  at  various  levels,  and  along 

different  pathways,  which  extract  different  kinds  of  information  from  the  stimulus. 

Information  comes  into  the  system  in  a  bottom  up  fashion  but  top-down 

mechanisms  adjust  the  attentional  systems  in  order  to  regulate  the  incoming 

information  and  make  expectations  about  what  may  or  may  not  be  coming  in. 

Galaburda  postulates  that  reading  involves  mental  functions  of  various  types, 

which  function  on  many  levels  and  pathways  at  the  same  time.  Hence,  to  have 

clinically  evident  dyslexia  multiple  systems  would  have  to  be  impaired,  at  different 

stages  and  along  different  pathways.  This  is  widely  agreed  by  researchers  in  the 

field. 

In  fact,  there  is  evidence  that  there  are  anomalies  in  both  perceptual  and 
cognitive  systems  in  dyslexics.  Dyslexics  have  fewer  margnocells,  which  process 
rapid  information  and  handle  location  and  orientation  in  space.  There  is  a  delay  of 
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this  type  of  low  level,  physical  information  into  the  brain  of  the  dyslexic,  hence, 
dyslexics  are  slower  at  this  type  of  perception.    That  is,  they  show  a  slowing  down 
of  transmission  of  the  stimulus  before  it  ever  begins  to  be  analyzed  linguistically  or 
otherwise.  Dyslexics  also  show  neurophysiological  anomalies  in  their  cognitive 
systems.  There  is  evidence  of  phonological  and  syntactic  problems,  problems  with 
verbal  memory  and  attention. 

Galaburda  concludes  that  multiple  processing  systems  are  involved  in  the 
neurobiology  of  dyslexia  He  suggests  this  is  the  result  of  the  connectivity  between 
different  systems  and  pathways  in  the  brain.  At  the  neurophysiological  level,  this 
can  be  seen  with  anomalies  that  begin  in  one  place  in  the  brain  and  propagate  to 
other  areas.  Due  to  the  connections  between  the  frontal  and  temporal  cortex 
(cognitive  processors)  and  the  visual  and  auditory  areas  of  the  thalamus  (perceptual 
processors),  damage  in  one  area  will  cause  problems  in  the  other.    Hence, 
anomalies  occurring  within  both  perceptual  and  cognitive  systems  result  in  effecting 
the  other. 

Summary  and  Concluding  Remarks 
The  empirical  evidence  reviewed  in  this  chapter  points  to  the  fact  that 
readers  do  make  use  of  the  various  linguistic  structures  encoded  in  scripts.  In 
particular,  studies  indicate  that  morphemes  and  syllables  as  well  as  segments  play  a 
role  in  visual  processing.  Moreover,  this  processing  evidence  is  simply  counterpart 
to  the  fact  that  scripts  represent  multiple  aspects  of  linguistic  systems.  In  spoken 
language,  phonemic  segments,  syllables,  onsets,  rimes,  morphemes  and  words  are 
all  elements  of  linguistic  representations  that  are  active  simultaneously.  It  is 
unsurprising  then  that  orthographies,  whose  function  is  to  represent  spoken 
language,  reflect  these  components  and  further,  that  readers  make  use  of  them  for 
interpretation. 
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The  facts  of  multileveled  representation  and  processing  have  several 
implications  that  are  relevant  to  reading  acquisition.  The  most  important  implication 
to  this  discussion  is  that  the  diversity  and  number  of  cues  readers  extract  from  the 
signal  must  effect  the  learning  process.  Orthographic  cues  are  several  and  of 
several  types.  As  discussed,  scripts  typically  cue  several  aspects  of  the  spoken 
language  simultaneously.  Also  important  is  that  while  scripts  represent  many 
things  at  once  they  do  so  only  partially.  That  is,  they  do  not,  probably  can  not, 
represent  all  the  information  in  the  spoken  signal.  Additionally,  information  about 
the  linguistic  structures  presented  in  scripts  is  not  uniformly  obvious.  For  example, 
linguistic  cues  may  sometimes  be  in  the  form  of  discrete  units,  e.g.,  a 
segment/letter,  a  syllable/syllabogram,  or  a  morpheme/logogram.  Yet,  very  often, 
units  such  as  segments,  syllables  and  so  on,  as  well  as  larger  units  like  words,  are 
indicated  by  grapheme  patterns.  Sometimes,  information  about  a  particular  unit  is 
indicated  by  cues  outside  the  boundary  of  the  unit  itself.  For  example,  double 
consonants  in  English  script  give  information  about  a  preceding  vowel  and  often 
across  a  syllable  boundary,  e.g.,  a-pple  .  Information  about  morpheme  and 
syllable  boundaries  are  reflected  by  letter  sequence  frequencies.  Cues  are  often 
overlapping  and  sometimes  conflicting.  For  example,  morpheme  and  syllable 
boundaries  may  overlap  as  in  pre-view  or  they  may  conflict  as  in  press-ing  and 
pressing. 

Scripts  have  adapted  to  serve  a  communicative  purpose.  Hence,  the 
organization  of  information  in  scripts  is  unlikely  to  have  arisen  by  accident.  Instead 
it  is  likely  that  orthographic  organization  is  functional  and  there  is  some  processing 
advantage  for  information  to  be  structured  in  this  fashion.   Indeed,  it  stands  to 
reason  that  it  is  because  of  processing  constraints  that  scripts  have  the  formal 
properties  that  they  do.  The  final  question  is  then,  what  are  these  constraints  and 
how  do  they  influence  learning?  That  is,  what  type  of  learning  system  can 
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accommodate  the  facts  (as  we  know  them  so  far)  of  multileveled  representation  and 
processing.  Learning  an  orthographic  system  involves  attending  to  a  set  of 
complex  and  often  diverse  cues.  Furthermore,  reading  instruction  most  often 
provides  learners  with  only  a  subset  of  the  relationships  they  must  internalize. 
Hence,  I  suggest  that  reading  involves  learning  mechanisms  --  implicit  learning  -- 
that  are  particularly  "good  at"  processing  information  that  is  complex,  incomplete, 
and  at  least  partly  unavailable  to  explicit,  conscious  learning  strategies. 


CHAPTER 5 
IMPLICIT  LEARNING 

Traditional  methods  in  education  presume  that  learning  is  an  intentional 
process  that  involves  a  conscious  awareness  of  the  nature  of  the  stimulus 
environment  and  the  application  of  explicit  attention  to  problem  solving  tasks. 
Likewise,  traditional  models  of  cognition  (Newell  and  Simon  1972)  posit  an 
explicit  information  processing  model  of  human  learning,  where  knowledge  is 
sequentially  constructed  by  goal-directed  processes.  Explicit  cognitive  processes 
include  reasoning,  problem  solving,  overt  recall  from  long  and  short  term  memory, 
awareness  of  self,  and  so  on.    However,  information  that  is  complex,  large  in 
quantity,  and  for  which  stimulus  cues  are  incomplete  or  conflicting  is  not  easily 
apprehended  by  explicit  processes.  Learning  such  information  typically  proceeds 
automatically  and  without  conscious  control.  Hence,  in  many  (perhaps  most) 
learning  situations  it  appears  that  a  good  proportion  of  knowledge  is  acquired  in  an 
unintentional,  non-reflective  manner.  This  type  of  knowledge  acquisition  has  come 
to  be  called  implicit  learning. 

Implicit  learning  is  a  process  whereby  complex  knowledge  of  richly 
structured  stimulus  domains  is,  for  the  most  part,  acquired  independently  of 
awareness  of  both  the  acquisition  process  and  the  knowledge  base  that  is  acquired 
(Winter  and  Reber  1994,  Reber  1967,  1976,  1989,  1993,  Berry  and  Dienes  1993, 
Berry  and  Broadbent  1984,  1988,  Mathews  et  al.  1989,  Mathews  1991, 
Cleeremans  1993).    This  learning  is  not  assumed  to  take  place  in  a  wholly 
incidental  way,  however.  It  is  conceived  as  a  by-product  of  attention  to  relevant 
rule-governed  structures,  be  it  natural  language,  socially  prescribed  behaviors  or  of 
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complex  economic  systems  (Winter  and  Reber  1994).  That  is,  while  the  learner 
explicitly  attends  to  certain  salient  stimuli,  underlying  structured  information  is 
implicitly  induced  and  represented  in  a  tacit  knowledge  base.  Hence,  implicit 
learning  is  a  general  inductive  process  that  extracts  patterned  relationships  from  the 
environment,  creating  a  tacit  knowledge  base. 

This  is  not  to  suggest  that  complex  structures  cannot  be  learned  through 
conscious  analytical  strategies.  The  point  rather  is  that  implicit  operations  can  be 
engaged  in  the  analysis  of  structured  patterns,  even  when  these  regularities  have  not 
been  consciously  detected  (Winter  and  Reber  1994).   Furthermore,  explicit 
processes  are  resource  intensive,  requiring  explicit  memory,  conscious  attention, 
and  overtly  controlled  hypothesis  testing.  On  the  other  hand,  implicit  mechanisms 
are  automatic,  unconscious  and  as  such,  less  resource  consuming,  thus  freeing  up 
explicit  mechanisms  for  higher  order  functions.  Implicit  mechanisms  may  therefore 
be  'better  at'  learning  structures  that  are  too  complex  for  conscious  analytical 
strategies. 

Empirical  Studies 
Implicit  learning  experiments  have  explored  the  process  by  which  people 
acquire  complex  knowledge  about  the  world,  largely  without  conscious  attempts  to 
do  so.  Three  different  implicit  learning  paradigms  have  yielded  consistent  and 
robust  results:  artificial  grammar  learning  (see  Reber  1993  for  a  comprehensive 
review),  process  control  (see  Berry  1993  for  a  comprehensive  review),  and 
sequence  learning  (see  Dienes  and  Berry  1993  for  a  comprehensive  review).  The 
following  review  does  not  attempt  to  address  all  the  complex  issues  that  have  arisen 
through  this  research.  My  goal  is  primarily  to  illustrate  the  kind  of  systems  and 
tasks  that  have  been  shown  to  consistently  engage  implicit  operations,  and  to 
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highlight  some  of  the  issues  that  I  believe  relevant  to  language  and  reading 
acquisition. 

Grammar  Learning 

Artificial  grammar  learning  is  the  most  prominent  and  widely  replicated  of 
the  implicit  learning  paradigms .  Grammar  learning  has  been  used  particularly  to 
examine  the  relationship  between  implicit  learning  and  conscious  awareness.  These 
experiments,  first  explored  by  Reber  (1965,1969),  have  subjects  attend  to  complex 
rule-governed  stimuli  (usually  strings  of  letters  or  geometric  shapes)  generated  by  a 
synthetic,  semantic  free,  Markovian,  finite-state  grammar.  There  is  a  well- 
developed  branch  of  finite  mathematics  that  underlies  these  grammars.  Further,  the 
stimulus  domains  are  novel  and  arbitrary  so  as  to  rule  out  the  effect  of  preexisting 
knowledge.  These  systems  are  too  complex  to  be  learned  by  simple  testing  of 
consciously  held  hypotheses. 
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Fig.  5. 1  The  first  and  simplest  of  the  Markovian  artificial  grammars  used  by  Reber 
( 1965)  with  several  grammatical  strings  that  it  generates  (Reber  1993). 
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The  typical  study  proceeds  in  two  phases.  Subjects  first  memorize  strings 
of  letters  generated  by  the  synthetic  language.  Later  they  are  tested  for  their 
knowledge  of  the  grammar's  rules  by  being  asked  to  make  decisions  concerning  the 
well-formedness  of  novel  strings  of  letters.   Subjects  are  not  informed  that  they  are 
working  with  rule-governed  stimuli  but  are  asked  only  to  memorize  the  strings  of 
letters  printed  on  cards.  Another  group  of  subjects  perform  the  same  task  but  with 
random  strings  of  letters.  Twenty  learning  stimuli  are  used  in  the  acquisition  phase, 
usually  three  to  eight  characters  in  length. 

In  experiments  of  this  type,  there  were  some  interesting  differences  between 
the  subjects  presented  with  the  grammatical  strings  and  those  presented  with 
random  strings.  With  practice,  both  groups  of  subjects  became  increasingly  good 
at  this  task.  However,  subjects  who  worked  with  grammatical  strings  made  an 
average  of  eighteen  errors  before  being  able  to  reproduce  all  four  strings  of  the  first 
set  correctly.  By  the  seventh  set  they  made  an  average  of  three.  Subjects  who 
worked  with  the  random  strings  showed  a  different  pattern.  They  too  began  with 
eightteen  errors  with  the  first  set  but  after  several  sets  make  an  average  of  eight 
errors.  The  improvement  that  occurred  among  both  groups  was  attributed  to  a 
"learning  to  learn"  effect  that  occurs  regardless  of  structure.  The  difference  in 
performance  between  the  two  groups  was  attributed  to  the  first  group's  ability  to 
exploit  the  structure  in  the  stimuli.  Furthermore,  the  subjects  that  confronted  the 
rule-governed  strings  were  then  able  to  use  what  they  had  learned  about  the  rules  of 
the  grammar  in  order  to  discriminate  new  strings  that  conformed  to  the  grammar. 
These  findings  are  very  robust  and  have  been  replicated  many  times  under  diverse 
conditions  (Reber  1965,  1969,1976;  Brooks  1978;  Fried  and  Holyoak  1984; 
Mathews,  Buss,  Stanley,  Blanchar-Feilds,  Cho  and  Druhan  1989;  Morgan  and 
Newport  1981;  and  Servan-Schreiber  and  Anderson  1990). 


80 


Equally  interesting  is  the  observation  that  subjects  transfer  implicitly 
inquired  knowledge  from  one  environment  to  another.  Reber  (1969)  asked  subjects 
to  memorize  a  set  of  strings  generated  from  a  finite  state  grammar.  He  later 
presented  them  with  the  task  of  memorizing  a  set  of  strings  generated  from  the  same 
grammar,  but  with  a  different  set  of  letters,  or  a  different  grammar.  He  found  that 
subjects'  performance  improved  on  the  second  set  of  strings  when  the  set  was 
generated  from  the  same  grammar,  even  when  the  strings  were  made  up  of  different 
letters.  Mathews  et  al.  (1989)  replicated  this  basic  finding.   The  researchers 
concluded  that  subjects  learned  abstract  regularities  of  the  stimulus  network. 

Another  interesting  issue  is  the  interaction  of  explicit  leaning  strategies  with 
implicit  learning.  Reber  (1976)  found  that  encouraging  subjects  to  "look  for  rules" 
during  the  memorization  phase  resulted  in  a  decrease  in  performance  on  the 
grammaticality  test,  indicating  some  kind  of  interference  effect.  Reber  concluded 
that  subjects  were  at  a  disadvantage  when  they  explicitly  searched  for  patterns 
because  the  properties  of  the  stimulus  array  (Markovian  grammar)  were  not  easily 
detected  by  conscious  problem  solving  strategies.  That  is,  the  grammatical  rules 
were  too  complex  to  be  deciphered  by  consciously  held  hypotheses.  However, 
when  the  rules  of  the  stimulus  array  were  made  more  salient  by  manipulating  the 
properties  of  the  material,  Reber,  Kassin,  Lewis  and  Cantor  (1980)  found  that  the 
interference  caused  by  explicit  rule  seeking  disappeared.  They  concluded  that 
looking  for  rules  works  only  if  you  can  find  them. 

Further  experiments  found  that  giving  the  subjects  specific  information 
about  the  stimulus  array  effected  learning.  Reber  et  al.  (1980)  addresses  this  issue 
with  another  grammar  learning  procedure  that  gives  subjects  both  an  explicit 
training  period  and  a  passive  observation  period.  Three  groups  of  subjects  were 
given  a  7-minute  course  using  a  schematic  diagram  of  the  grammar  which  illustrated 
the  rule  system  and  were  instructed  how  the  grammar  generated  strings.  They  also 
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simply  observed  a  set  of  correct  exemplars.  This  explicit  training  was  combined 
with  the  implicit  learning  procedure  (observation  only)  in  various  ways.  All  three 
groups  of  subjects  received  the  explicit  training  session,  but  received  the  training  at 
different  times.  One  group  received  training  at  the  outset;  another  at  midcourse, 
when  some  of  the  exemplars  had  been  seen.  The  last  group  received  training  at  the 
end,  when  all  of  the  exemplars  had  been  seen.  As  usual,  learning  was  tested  by  the 
subjects'  grammaticality  judgments. 

The  authors  found  that  the  earlier  the  explicit  instructions  are  given,  the 
more  effective  they  were.  Reber  suggests  that  providing  explicit  instructions  at  the 
outset,  provided  they  are  correct,  serves  to  focus  and  direct  the  subjects'  attention. 
The  instructions  given  did  not  teach  the  grammar  in  any  complete  way,  but  simply 
oriented  the  subjects  toward  the  relevant  invariances  in  the  stimuli,  so  that  the 
subjects  were  able  to  teach  themselves.  On  the  other  hand,  when  subject  were 
given  explicit  training  during  or  after  the  implicit  observation  period,  the  training 
was  less  likely  to  facilitate  learning.  The  authors  suggested  that  giving  explicit 
information  about  the  system  aftersubjects  have  passively  observed  it  imposed  a 
formalization  of  structure  not  in  concord  with  the  system  that  subjects  were 
inducing.  Moreover,  individual  learners  may  build  representations  that  vary  quite 
widely  from  each  other.  Several  researchers  have  observed  that  there  seems  to  be 
wide  variation  among  individuals  in  regard  to  implicitly  acquired  knowledge  of 
artificial  grammars  (Dulany  et  al.  1986;  Reber  1989,  Mathews  et  al.  1989). 
Holland  et  al.'s  (1986)  competitive  rule  induction  model  predicts  wide  individual 
variability  in  the  initial  features  induced  from  artificial  grammars  because  many 
different  cues  (exemplar  fragments)  could  be  used  to  select  valid  strings.  Once  a  set 
of  cues  become  strengthened  by  a  sufficient  number  of  successful  predictions,  there 
is  no  reason  for  the  learner  to  further  modify  the  knowledge  base.  Consequently, 
the  learners'  representations  are  likely  to  vary  across  individuals.  Likewise,  learner 
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representations  are  likely  to  differ  from  the  formalization  represented  by  the 
schematic  of  the  Markovian  grammar.  Hence,  if  information  about  the  grammar's 
formalization  is  given  after  learners  have  developed  their  own  representations  of  the 
system,  it  causes  interference  and  learning  delay. 

Mathews  et  al.  (1989)  further  investigated  the  interaction  of 
implicit/explicit  processes  on  learning.  They  compared  subjects  performance  with 
two  types  of  artificial  grammars:  one  Markovian  finite-state  grammar  and  one  based 
on  simple  logical  rules.  The  authors  hypothesized  that,  where  structure  is  opaque 
as  in  the  case  of  the  finite  state  grammar,  explicit  learning  strategies  should  not 
facilitate  learning  beyond  what  was  implicitly  induced  .  The  grammar  based  on 
simple  logical  rules  however,  should  be  accessible  to  explicit  processes.  In  order  to 
examine  the  interaction  between  implicit  an  explicit  learning  with  both  types  of 
grammars,  two  types  of  tasks  were  devised.  The  first  was  an  implicit  "match" 
short-term  memory  task.  Subjects  were  presented  with  a  single  valid  string  from 
the  grammar  and  told  to  hold  it  in  memory  for  a  few  seconds  until  five  choices 
appeared  on  the  screen.  They  were  then  asked  to  select  the  identical  string  from 
these  choices,  unaware  that  the  strings  were  rule  generated.  The  second  task  was 
an  explicit  "edit"  task.  Subjects  were  exposed  to  the  same  set  of  items,  generated 
by  the  same  grammar  as  in  the  match  task.  Yet,  one  of  five  letters  in  each  string 
was  incorrect.  Subjects  were  told  that  they  would  see  items  that  were  flawed 
strings  generated  by  a  grammar.  Their  task  was  to  figure  out  the  rules  of  the 
grammar  so  that  they  could  identify  the  incorrect  letter  in  each  string. 

There  were  five  experimental  groups.  One  group  only  performed  the 
implicit  match  task,  another  only  the  explicit  edit  task.  The  other  three  groups 
performed  both  tasks  in  equal  amounts.  Of  these  groups,  one  (match/edit)  did  the 
implicit  match  task  first  on  half  the  trials,  then  the  explicit  edit  task  on  the  other  half. 
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Another,  (edit/match)  did  the  edit  task  first  then  the  match.  The  last  group 
alternated  match  and  edit  trials. 

The  results  using  the  finite-state  grammar  were  that  there  were  no  significant 
differences  among  groups,  though  the  groups  that  did  the  match  task  only  and  the 
match  task  first  performed  slightly  better  than  the  others.    The  authors  concluded 
from  this  that  implicit  induction  is  capable  of  extracting  complex  patterns  without 
conscious  attempts  to  discern  those  patterns,  and  that  explicit  learning  strategies  did 
not  aid  knowledge  acquisition  of  a  finite  state  grammar.  These  findings  support 
earlier  conclusions  (Reber  et  al.  1980)  that  explicit  learning  strategies  do  not  aid 
learning  unless  the  structure  is  sufficiently  obvious  to  find. 

There  were  significant  differences  among  experimental  groups  however, 
when  the  same  experiment  was  done  using  the  simple,  logical  rule  grammar .  The 
match/edit  group  performed  significantly  better  than  all  the  other  groups.  The 
match  only  group  performed  the  worst,  nearly  at  chance  levels.  These  results 
suggest  that  implicit  learning  is  operative  both  in  environments  where  structure  is 
quite  opaque  (finite-state  grammar),  and  where  structure  is  more  salient  (logical 
grammar).  Also,  it  appears  that  when  stimuli  are  obvious  enough  for  explicit 
learning  strategies,  the  interaction  of  implicit  and  explicit  learning  produced  a 
significantly  better  understanding  of  the  system,  indicating  a  "synergistic  effect." 
That  is,  when  implicit  and  explicit  functions  are  allied,  learning  is  more  effective 
than  it  would  otherwise  be.  Moreover,  the  sequential  order  of  implicit  and  explicit 
learning  is  relevant.  The  match/edit  group's  greatly  superior  performance  suggests 
that  a  period  of  implicit  observation  benefits  learning  a  more  salient  system.   That 
is,  for  systems  equivalent  to  the  simple  logical  grammar,  the  optimal  procedure  for 
learning  is  to  develop  an  implicit  knowledge  base  before  beginning  to  generate  an 
explicit  model  of  the  system. 
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Another  relevant  issue  that  has  been  explored  with  the  grammar  learning 
paradigm  is  the  availability  to  consciousness  of  the  implicit  knowledge  base. 
Though  early  research  with  implicit  learning  claimed  a  clear  distinction  between 
implicit  and  explicit  processes,  most  researchers  now  agree  that  implicitly  acquired 
knowledge  is  not  wholly  unconscious,  and  that  various  types  of  tacit  knowledge 
bases  will  differ  in  degree  in  its  availability  to  consciousness.  Reber  and  Lewis 
(1977)  suggest  that  a  tacit  knowledge  base  may  become  available  to  consciousness, 
but  the  ability  to  explicitly  state  knowledge  acquired  from  implicit  learning 
processes  always  lags  behind  what  is  known  unconsciously.  In  a  four-day  study 
subjects  solved  anagram  puzzles  generated  by  an  artificial  grammar.  Subjects  were 
then  asked  to  introspect  and  write  the  rules  of  the  grammar,  or  the  rules  and/or 
strategies  they  were  using  to  solve  the  puzzles.  Over  the  four  days,  there  was  a 
general  increase  in  the  subjects'  ability  to  communicate  their  knowledge  of  the 
system.  Yet,  there  was  also  an  increase  in  ability  to  solve  the  anagrams.  As 
subjects  increased  their  ability  to  verbalize  the  system's  rules,  they  also  acquired  a 
richer  knowledge  of  the  system.  Consequently,  task  performance  was  always 
ahead  of  explication. 

The  Mathews  et  al.  (1989)  study  also  addressed  the  consciousness  issue. 
The  authors  used  a  "teach  aloud"  technique  whereby  subjects  were  asked  during  the 
task  to  stop  and  to  explain  to  another  subject*  what  rules  or  strategies  they  were 
using  to  learn  the  grammar.  This  information  was  given  to  second  control  group  of 
subjects  who  were  tested  in  the  same  well-formedness  task.  With  this  information, 
the  second  group  performed  about  half  as  well  as  the  first  group.  As  the 
experiment  progressed  and  the  proficiency  of  the  first  group  increased,  so  did  that 
of  the  control  group,  yet  those  in  the  control  group  never  caught  up  with  the  first 


Ifn^6?  re,CeiVing  instructions  about  *•  grammar  were  not  present.  Instead  the  subjects 
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group.  The  authors  concluded  that  implicit  knowledge  of  the  artificial  grammar  is 
available  to  consciousness,  though  incompletely.  Moreover,  the  ability  to  examine 
and  report  the  knowledge  acquired  improves  with  practice,  but  more  slowly  than 
the  acquisition  process. 

A  final  issue  involving  an  important  aspect  of  the  teach  aloud  procedure  was 
the  type  of  information  subjects  communicated.  Virtually  all  subjects  expressed 
their  knowledge  in  terms  of  instructions  to  select  or  avoid  certain  letter  groups. 
For  example,  "select  strings  that  begin  with  SCP"  or  "select  strings  that  end  in 
VV."  Successive  verbal  reports  over  trials  included  elaborations  and  exceptions  to 
"rules"  provided  on  earlier  trial  blocks.  This  information  included  both  item 
specific  and  more  general  features  of  the  grammar.  Mathews  et  al.  (1989) 
suggested  that  subjects  select  patterns  of  common  features  (groups  of  letters)  shared 
by  many  exemplars.   This  is  the  type  of  acquired  knowledge  predicted  by  Holland 
et  al.'s  (1986)  rule  induction  model  and  is  consistent  with  the  idea  that  learners 
build  a  body  of  knowledge  by  encoding  small  subsequences  of  strings  (Reber  and 
Lewis  1977  and  Servan-Schriber  and  Anderson  1990).  For  instance,  it  is 
hypothesized  that,  given  a  string  TTXVPXVS  to  be  committed  to  memory,  subjects 
may  first  create  string  "fragments"  or  "chunks"  TTX  VP  XVS.  These  chunks  both 
facilitate  an  constrain  further  learning.  For  example,  the  following  string: 
VXVPXXXVS  is  likely  to  be  perceived  in  terms  of  previously  analyzed  chunks  VX 
(VP)  XX  (XVS).  Hence,  the  new  string  seems  less  complex  and  constrains  further 
chunking. 

Grammar  learning  and  natural  language  learninp    Nations  and  McLaughlin 
( 1986)  use  an  artificial  grammar  learning  paradigm  in  order  to  suggest  an 
information  processing  approach  to  language  learning.  They  proceed  from  the 
assumption  that  mastery  of  complex  tasks  involve  the  integration  of  controlled 
processing  -  which  is  resource  intensive,  requiring  a  large  amount  of  processing 
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capacity,  and  automatic  processing  --  which  is  low  demand  and  requires  little 
processing  energy27.  They  contrast  three  groups'  performance  in  a  grammar 
learning  task:  monolinguals,  bilinguals  and  multilinguals.  All  subjects  participated 
in  two  phases  of  the  experiment.   The  first  phase  consisted  of  an  implicit  learning 
task,  which  required  subjects  to  attend  to  the  stimuli  without  instruction  as  to  what 
they  should  learn,  while  the  second  phase  involved  an  explicit  task  whereby 
subjects  were  informed  that  the  stimuli  followed  rules  and  were  instructed  to  try  to 
discover  them.  After  each  phase  subjects  were  asked  make  grammaticality 
judgments  of  exemplar  strings.  The  results  were  that  multilinguals  performed 
significantly  better  on  the  implicit  learning  task  than  either  of  the  other  groups  and 
that  there  were  no  differences  between  groups  on  the  explicit  task. 

The  authors  hypothesize  that  strategies  used  by  "expert"  learners 
(multilinguals)  differ  sharply  from  "novice"  (monolinguals  and  bilinguals)  learners. 
Basically,  the  contention  is  that  multilinguals  are  successful  with  the  implicit  task 
because  they  have  automated  the  basic  strategies  of  pattern  recognition,  while  the 
novice  learner  groups  have  not.  Relevant  to  the  greater  discussion  is  that 
multilinguals  superior  performance  suggests  that  pattern  recognition  is  a  significant 
part  of  the  language  learning  task.  Moreover,  expert  learners  tend  to  use  implicit 
learning  strategies  in  the  proper  environments,  which  allow  them  to  allocate  their 
cognitive  resources  more  efficiently  than  novices. 


The  automatic/controlled  distinction  is  generally  used  to  account  for  the  progression  from  a 
slower,  more  cogniuvely  demanding  stage  to  an  automated  stage  of  performance.  Automatic 
processing  occurs  when  a  particular  response  has  been  built  up  by  processing  the  same  input  over 
many  trials.  Controlled  processing  is  deliberate,  temporary  activation  of  memory  though 
attentional  control  (Schneider  and  Shiffrin  1977;  Anderson  1983;  McLaughlin  1990)  While  the 
authors  use  an  implicit  learning  paradigm  they  do  no  specify  how  exacdy  the  automatic/controlled 
distinction  correlates  with  the  implicit/explicit  distinction  or  specify  the  differences  between  them 
1  he  relationship  between  implicit/  explicit  and  automatic/  controlled  is  not  necessarily  equivalent 
smce  explicitly  learned  matenal  can  become  automated.  Implicit  learning,  also,  becomes  more 
automated  with  practice.  However,  I  believe  that  the  results  of  this  experiment  contnbutes  to  (he 
empirical  data  on  implicit  learning  and  bears  on  the  overall  discussion. 
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Process  Control 

Other  evidence  that  demonstrates  implicit  learning  comes  from  process 
control  studies  which  have  found  that  people  are  able  to  control  complex  systems 
even  when  they  lack  the  ability  to  verbalize  the  knowledge  required  to  perform  such 
tasks.  Berry  and  Broadbent  ( 1984)  looked  at  the  effects  of  the  learning  task,  verbal 
instruction  and  subject's  ability  to  verbalize  their  knowledge.  Subjects  were 
required  to  control  the  output  of  a  simulated  system,  a  sugar  production  plant  for 
example,  by  setting  the  value  of  one  or  more  input  variables.  In  the  role  of  manager 
of  a  sugar  production  plant,  subjects  were  asked  to  modify  production  to  achieve 
some  preassigned  level.  The  control  was  implemented  by  adjusting  production 
variables  such  as  wages,  labor  peace,  and  number  of  workers  hired.  The  computer 
was  programmed  with  rules  that  related  production  levels  to  these  factors.  The 
subjects'  task  was  to  make  shifts  in  the  value  of  the  relevant  variable  in  an  attempt 
to  achieve  the  target  production  levels.  The  computer  interacted  with  the  subjects 
by  displaying  production  levels  in  response  to  their  adjustments.  Following  the 
experience  subjects  were  required  to  complete  a  written  questionnaire  that  asked 
about  the  relationships  within  the  system. 

The  results  were  that  task  performance  improved  significantly  with  practice 
but  performance  on  post-task  questions  did  not  improve.  In  fact,  subjects'  post- 
task  responses  were  no  better  than  compared  to  subjects  with  no  experience  at  all. 
Conversely,  explicit  verbal  instruction  on  how  to  reach  and  maintain  the  target  value 
significantly  improved  ability  to  answer  questions,  but  had  no  effect  on  task 
performance.  However,  explicit  instructions  did  have  a  beneficial  effect  on  task 
performance  when  subjects  were  also  required  to  verbalize  a  reason  for  each  control 
value  they  chose.  Overall,  however,  there  was  no  evidence  of  a  positive 
association  between  task  performance  and  question  answering.  Also,  across  three 
experiments  with  similar  types  of  tasks,  subjects  who  were  better  at  controlling  the 
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task  were  significantly  worse  at  answering  questions.  Berry  and  Broadbent 
concluded  that  it  was  likely  that  these  type  of  tasks  were  learned  implicitly. 

Broadbent,  Fitzgerald  and  Broadbent  (1986)  replicated  these  findings 
using  a  city  transport  system  control  task  and  an  economic  model  task.  Again 
results  showed  that  task  performance  improved  with  practice  but  the  same  did  not 
hold  with  respect  to  answering  questions  about  the  task.  Other  studies  (Hayes  and 
Broadbent  1988)  indicated  that  inducing  subjects  to  use  explicit  control  strategies 
(like  Reber's  "looking  for  rules"  in  an  artificial  grammar)  resulted  in  a  decrease  in 
task  performance.  This  suggests  that,  like  Markovian  grammar  learning,  when  the 
system  regularities  are  sufficiently  opaque,  explicit  learning  strategies  do  not  help. 
Likewise,  Berry  and  Broadbent  ( 1987,1988)  found  that  when  they  manipulated  the 
stimuli  such  that  it  became  more  salient,  subjects  were  able  to  answer  explicit 
questions  about  the  material  and  performance  was  correlated  with  explicit 
knowledge. 

These  results  are  strikingly  similar  to  the  results  of  the  grammar  learning 
task.  It  appears  that  looking  for  structure  only  helps  if  structure  is  easy  to  find . 
Similarly,  it  appears  that  whatever  type  knowledge  base  is  acquired,  it  is  difficult  to 
verbalize.  Stanley,  Mathews,  Buss  and  Kotler-Cope  (1989)  showed  that,  in  a  task 
which  asked  one  group  of  subjects  to  explain  the  system  to  other  subjects,  the  first 
group  of  subjects  attained  high  levels  of  performance  long  before  they  were  able  to 
communicate  that  knowledge.  Moreover,  typically  in  process  control  studies, 
subjects  report  that  they  made  certain  decisions  because  they  "feel  right"  or  may 
simply  believe  that  they  are  guessing  (Berry  and  Dienes  1993). 

Sequence  Pattern  Acquisition 

Sequence  pattern  acquisition  involves  three  different  types  of  tasks:  choice 
reaction  time  tasks,  prediction  tasks  and  probability  learning  tasks.  The  rationale 


underlying  the  sequence  learning  paradigm  is  that  responses  reflect  sensitivity  to 
sequential  structure  present  in  the  material  (Cleeremans  1993).  In  a  choice  reaction 
task  (Lewicki,  Hill  and  Bizot  1988)  subjects  were  presented  with  a  complex  matrix 
of  numbers  arranged  in  four  quadrants  on  a  computer  screen.  The  task  was  to  note 
the  location  of  the  target  number  by  quickly  pressing  a  button  indicating  the  proper 
quadrant.  A  continuous  sequence  of  target  numbers  appeared  in  one  of  the  four 
quadrants.  The  sequential  structure  of  the  material  was  manipulated  to  generate 
sequences  of  five  elements  according  to  a  set  of  simple  rules.  Each  rule  defined 
where  the  next  stimulus  appear  (e.g.,  which  of  four  quadrants  of  the  computer 
screen).  Locations  were  a  function  of  the  position  of  the  two  previous  stimuli.  The 
first  two  elements  of  each  sequence  was  unpredictable,  while  the  last  three  were 
determined  by  their  predecessors.  The  dependent  measure  was  reaction  time. 
Subjects  were  unaware  that  the  sequence  of  successive  trials  contained  regularities. 
With  practice  subjects  whose  trial  sequence  was  structured  showed  a  decrease  in 
reaction  time  that  was  significantly  greater  than  subjects  who  observed  material  with 
no  sequential  structure.  Also,  subjects  were  unable  to  articulate  the  rules  they 
appeared  to  be  using. 

Prediction  tasks  use  essentially  the  same  design,  but  instead  of  reacting  to 
the  current  stimulus,  subjects  are  asked  to  predict  the  next  stimulus  or  where  it  will 
appear .  In  this  case  the  dependent  measure  is  the  percentage  of  correct  predictions 
over  trials.  The  usual  result  is  that  subject's  ability  to  predict  the  location  of  the 
next  trial  improves  with  practice,  yet  they  are  unable  to  articulate  how  they  came  to 
their  decision  (Cleeremans  1993). 

Experiments  using  the  probability  learning  task,  introduced  by  Millward  and 
Reber  ( 1968,  1972),  asked  subjects  to  observe  a  sequence  of  events  and  predict 
what  comes  next  in  the  sequence.  Subjects  anticipated  the  changing  probabilities  of 
events  even  when  the  anticipatory  response  required  and  integration  of  information 
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across  50  preceding  events  (Reber  1993).  The  experiment  consisted  basically  of 
subject's  watching  two  lights  rapidly  flashing  on  and  off  in  particular  patterns. 
Subjects  were  given  a  thousand  trials  at  the  rate  of  two  per  second  and  were  asked 
to  predict  the  successive  events  before  they  occurred.  The  probability  of  any  event 
was  systematically  increased  and  decreases  as  they  moved  through  the  trials. 
Subjects  learned  to  anticipate  shifts  in  the  likelihood  of  events  but  their  performance 
was  random  when  asked  to  recall  crucial  trials  by  which  they  formed  their 
decisions.  The  findings  from  sequence  pattern  acquisition  experiments  are  similar 
to  that  of  the  other  learning  paradigms  in  the  following  respects.    Subjects  appear 
to  be  exploiting  the  structure  of  the  stimuli  in  order  to  perform  the  task.  Learning  is 
gradual  and  improves  over  many  trials.  Subjects  are  not  easily  able  to  identify  how 
they  made  their  decisions  or  articulate  the  "rules"  they  seem  to  be  using. 

Learning  Mechanisms  and 
Knowledge  Representation 

Empirical  research  has  led  to  much  speculation  about  the  mechanisms  by 

which  implicit  learning  occurs.  In  fact,  there  is  much  debate  in  the  literature  as  to 

the  nature  of  implicitly  acquired  knowledge.  Reber's  early  definition  of  implicit 

processes  gave  them  the  general  characteristics  of  producing  an  abstract,  tacit 

knowledge  base  that  is  representative  of  a  structured  environment.  He  further 

suggested  that,  implicit  learning  optimally  occurs  independently  of  conscious 

attempts  to  learn.   Ongoing  research  has  found  that  this  view  to  be  an 

oversimplification.  The  idea  that  implicit  learning  produces  an  abstract  and 

unconscious  knowledge  base  has  been  particularly  disputed.   This  is  largely 

because  of  the  difficulty  of  demonstrating  (or  defining)  the  notions  of 

"abstractness"  and  "consciousness".  In  general,  implicitly  acquired  knowledge  has 

been  consistently  shown  to  be  resistant  to  conscious  inspection.  Yet,  empirical 

research  has  indicated  that  depending  on  the  system  being  learned  and  the  type  of 
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task  employed  in  learning,  the  knowledge  base  will  vary  in  its  availability  to 
conscious  awareness.  That  is,  conscious  awareness  is  a  gradient  property  of 
implicit  knowledge. 

Abstractness  also  appears  to  be  a  matter  of  degree.  Early  research 
characterized  implicit  knowledge  as  abstract  rule-based  knowledge.  What  was 
generally  meant  by  abstraction  was  that  the  implicit  knowledge  is  not  about  specific 
exemplars  or  based  on  relevant  features  of  these  exemplars,  but  instead  that  the 
knowledge  is  rule-like.  However,  there  is  abundant  evidence  that  what  subjects 
acquire  is  not  an  abstract  rule-governed  schematic  (like  the  actual  schematic  of  the 
Markovian  grammar),  but  fragmentary  knowledge  about  local  constraints.  Hence, 
the  acquired  knowledge  is  not  abstract  or  rule-governed  in  this  early  sense.  Later 
rule-based  theories  specify  relationships  between  exemplar  fragments,  however. 
This  network  of  local  relationships  are  thought  to  be  abstract  in  nature.   For 
example,  Reberand  Lewis  (1977)  suggested  that  bigram  or  trigram  invariance  rules 
are  used  to  select  valid  strings.  Holland  et  al.  ( 1986)  posit  a  competitive  rule 
induction  system  in  which  local  (fragmentary)  features  are  abstracted  from 
exemplars.  The  cues  identified  become  strengthened  with  each  successful 
prediction.  Implicit  learning  processes  continually  modify  the  strength  of 
competing  rules. 

Some  researchers  adopt  the  view  that  implicit  learning  of  exemplar 
fragments  involves  a  memory  association-based  system  where  the  resulting 
knowledge  base  need  not  be  abstract  at  all.  As  described  above,  learning  involves 
"chunking"  short  groups  of  elements  to  form  a  network  of  local  relationships, 
however,  grammatical  knowledge  is  tied  to  a  specific  letter  set  (Servan-Schreiber 
and  Anderson  1990).  In  fact,  most  researchers  now  acknowledge  that  encoding 
small  substrings  is  central  to  the  grammar  learning  task  (Dulany  et  al.  1984; 
Perruchet  and  Pacteau  1990;  Reber  and  Lewis  1977;  Holland  et  al.  1986;  Mathews 
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et  al.  1989;  Reber  1993;  Cleeremans  1993;  Winter  and  Reber  1994).   This  is  true 
for  rule  induction  models  as  well  as  memory  based  models.    The  debate  now  turns 
on  the  issue  of  whether  the  resultant  network  of  chunks  becomes  an  abstract 
knowledge  base  (Reber  1989,1993;  Reber  and  Lewis  1977;  Mathews  et  al.  1989), 
or  whether  grammatical  knowledge  is  tied  to  a  specific  letter  set  (Servan-Schriber 
and  Anderson  1990;  Perruchet  and  Pacteau  1990).   The  most  convincing  evidence 
that  tacit  knowledge  is  abstract  is  subjects'  ability  to  transfer  their  grammar 
knowledge  to  letter  strings  they  have  never  seen  before.  This  effect  has  been 
replicated  many  times.   Mathews  et  al.  (1989)  suggest  that  while  grammar  learning 
relies  on  association-based  chunking,  the  resultant  knowledge,  a  network  of 
chunks,  is  abstract.  Moreover,  this  combination  of  item  specific  and  general- 
abstract  information  is  the  type  of  acquired  knowledge  the  Holland  et  al.  ( 1986)  rule 
induction  model  predicts. 

The  issue  of  rule-based  versus  association-based  implicit  learning 
mechanisms  has  been  very  difficult  to  sort  out  empirically.  Indeed,  the  mental 
representation  of  structure  may  fall  between  storage  of  specific  instances  in 
memory,  to  more  general  abstract  descriptions.    In  fact,  systems  based  on  memory 
for  specific  exemplars  and  systems  based  on  abstraction  of  rules  may  be 
functionally  equivalent  (Cleeremans  1993).  That  is,  although  these  respective 
models  make  different  predictions  about  knowledge  representation,  there  is 
considerable  overlap  in  terms  of  their  simulating  human  performance.   While  this 
continues  to  be  a  theoretical  point  of  contention,  most  researchers  agree  that  implicit 
learning  tends  toward  the  following  characteristics: 

•  Implicitly  acquired  knowledge  is  fragmented,  consisting  of  many  units  (e.g., 
"chunks"  or  "fragments")  that  encode  local  information  (Dulany  et  al.  1984; 
Perruchet  and  Pacteau  1990;  Reber  and  Lewis  1977;  Holland  et  al.  1986- 
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Mathews  etal.  1989;  Servan-Schriber  and  Anderson  1990;  Reber  1993; 
Cleeremans  1993;  Winter  and  Reber  1994). 
•  Implicit  learning  and  the  implicit  knowledge  base  are  strongly  constrained  by  the 
learning  task  and  subjects'  (cognitive)  ability  to  perform  that  task.  For 
example,  learning  the  finite-state  grammar  will  be  constrained  by  whether  the 
task  is  to  memorize  the  strings  or  to  figure  out  the  system  of  rules  that  generate 
the  strings.  The  various  experimental  paradigms  reviewed  differ  greatly  in  the 
nature  of  the  task  and  hence  will  point  to  different  constraints  on  processing 
(Reber  et  al.  1980;  Berry  and  Broadbent  1989;  Hayes  and  Broadbent  1988; 
Mathews  etal.  1989). 

•  Processing  is  unselective  (Berry  and  Broadbent  1988).  Implicit  learning 

strategies  allow  the  system  to  remain  sensitive  to  all  dimensions  of  the  stimulus, 
at  the  cost  of  slow,  gradual  learning.  By  contrast,  explicit  learning  selects 
specific  dimensions  of  the  stimulus  which  (if  correct)  results  in  faster  learning. 

•  Implicit  learning  proceeds  automatically,  unintentionally  and  is  resistant  to 

conscious  inspection.  However  it  can  be  used  to  guide  subsequent  task  related 
judgments  and  decisions  (Berry  and  Broadbent  1988;  Reber  1989;  Berry  and 
Dienes  1993;  Winter  and  Reber  1994) 

•  For  most  learning  situations,  both  implicit  and  explicit  mechanisms  operate 

simultaneously.  1  will  elaborate  on  this  point  in  the  next  section. 

Implicit  versus  Explicit  Learning  Mechanisms 

Although  implicit  learning  experiments  attempt  to  isolate  implicit  processes, 
the  literature  suggests  that  both  implicit  and  explicit  mechanisms  are  involved  in 
learning  complex  systems.  Indeed,  many  of  the  tasks  (in  this  review)  used  to  elicit 
implicit  processes  involve  both  explicit  and  implicit  learning.   Furthermore,  implicit 
and  explicit  learning  involve  processes  which  may  vary  in  their  accessibility  to 
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conscious  awareness,  degree  of  generality,  and  degree  to  which  these  processes 
facilitate  each  other.  The  relevant  question  then  becomes,  what  factors  are  likely  to 
influence  the  implicit/explicit  learning  interface  for  a  particular  task  and  a  particular 
system  to  be  learned? 

Cleeremans  (1993),  following  Holland  etal.  (1986)  suggests  that  particular 
systems  and  particular  tasks  will  elicit  learning  strategies  that  range  from  purely 
implicit  to  explicit  depending  on  "regularity  salience"  and  "task  demands. "  The 
regularity  salience  of  a  particular  system  depends  on  a  mixture  of  the  following:  1 ) 
The  number  of  cues  and  features  present  in  the  stimuli.  Implicit  learning  is  active 
typically  when  the  number  of  cues  and  features  present  in  the  system  exceeds  the 
learner's  ability  to  attend  to  them.  Where  a  system's  features  are  restricted  enough, 
learners  are  likely  to  engage  explicit  problem  solving  techniques.  2)  The  amount  of 
external  feedback  about  the  specific  relevance  of  features  and  cues.  In  implicit 
learning  situations,  external  feedback  is  usually  minimal.  Explicit  learning  is 
typically  guided  by  feedback.  3)  The  validity  of  cues.  Implicit  learning  situations 
are  typically  noisy  environments,  where  cues  are  only  partially  valid  and  have  many 
exceptions.  Systems  where  cues  are  valid,  complete  and  obvious  are  likely  to 
engage  explicit  reasoning  strategies. 

A  learning  situation  that  has  a  very  wide  or  diverse  number  of  cues,  little 
external  feedback,  and  presents  learners  with  noisy  environments  would  have  low 
regularity  salience.  Indeed,  research  has  indicated  that  the  lower  the  regularity 
salience,  the  more  likely  learners  will  engage  in  "pure"  implicit  learning,  and  the 
less  likely  they  are  to  explicitly  seek  regularities.  On  the  other  hand,  the  higher  the 
regularity  salience,  the  easier  regularities  are  to  find,  the  more  likely  learners  will 
engage  in  explicit  problem-solving.     The  demands  of  a  particular  task  will  interact 
with  regularity  salience  and  influence  learning.  The  higher  the  task  demands,  the 
more  likely  learners  will  attempt  to  use  explicit  strategies.  For  example,  if  learners 


are  instructed  to  look  for  structure  in  the  material,  the  demands  of  the  task  increase, 
and  they  will  attempt  explicit  code  breaking  techniques.  In  contrast,  instructions  to 
memorize  a  series  of  exemplars  over  many  trials  is  a  lower-demand  task.  For 
example,  Mathews  al.'s  (1989)  "match  task"  required  subjects  to  remember  an 
exemplar  and  select  an  identical  string  from  five  choices.  The  demand  of  this  match 
task  is  a  lower  than  the  "edit  task,"  which  required  the  subject  figure  out  the  rules  of 
the  grammar  in  order  to  identify  the  errors  in  the  exemplars. 

Most  learning  situations  will  engage  a  mixture  of  implicit  and  explicit 
learning.  Cleeremans  (1993)  suggests  that  learning  situations  can  be  characterized 
by  regularity  salience  and  task  demands  that  are  gradient  from  low  to  high. 
Paradigms  like  sequence-learning,  which  are  characterized  by  low  regularity 
salience  and  low  task  demand,  are  most  likely  to  result  in  successful,  purely  implicit 
learning.   Tasks  characterized  by  high  regularity  salience  and  task  demands,  like 
problem  solving,  are  likely  to  result  in  successful  explicit  learning.  Many  other 
learning  situations  fall  at  different  points  between  and  will  engage  a  mixture  of 
implicit  and  explicit  processes**.   This  issue  is  particularly  relevant  to  the  overall 
discussion.  I  suggest  that  learning  to  process  written  language  involves  the  type  of 
tasks  (decoding,  comprehension,  interpretation)  and  the  type  of  system  (an 
orthography)  that  is  likely  to  engage  both  implicit  and  explicit  processes.  I  will 
discuss  this  point  in  detail  in  chapter  6. 

Implicit  and  Explicit  Learning  and  Individual  Differences 
Reber  (1989)  hypothesizes  that  implicit  learning  functions  are  a 
phylogenetically  primitive  system  that  emerged  before  the  development  of 
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Cleeremans  (1993: 18)  dluslrates  regularity  salience  and  task  demands  as  two  dimensions  of  the 
implicit/explicit  interaction.  He  classifies  all  learning  paradigms  reviewed  based  on  these 
dimensions.     I  believe  that  most  natural  learning  situations  (like  reading)  will  involve  an 
interface  of  implicit  and  explicit  learning  and  can  similarly  be  characterized  along  these 
dimensions.  B 
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conscious  functions.  That  is,  implicit  functions  are  older,  evolutionary  speaking. 
This  has  a  number  of  implications  concerning  the  interface  of  the  implicit  and 
explicit.  Among  these  is  the  notion  that,  following  the  principle  of  evolutionary 
biology  that  older  functions  will  be  displayed  in  common  across  a  species,  implicit 
cognitive  functions  should  show  less  individual-to-individual  variability  than 
explicit  functions. 

Reber,  Walkenfeld  and  Hernstadt  (1991)  explored  this  premise.   The 
authors  observed  a  group  of  subjects  who  worked  with  two  different  tasks.  One 
group  performed  an  implicit  grammar  learning  task,  while  the  other  performed  an 
explicit  problem  solving  task.  As  predicted,  there  were  substantial  individual 
differences  between  the  subjects  on  the  problem  solving  task.  However,  there  were 
relatively  small  differences  among  subjects  with  the  grammar  learning  task. 
Another  specific  prediction  of  this  principle  would  be  that  a  measure  of  explicit 
cognitive  functions  such  as  standard  IQ  tests"  should  correlate  with  performance 
on  the  explicit  task  but  not  the  implicit  one.  Indeed,  the  authors  found  that 
performance  on  the  explicit  task  strongly  correlated  with  IQ,  but  performance  on  the 
implicit  task  did  not. 

Developmental  Issues 
Another  implication  of  the  premise  that  implicit  functions  are  evolutionarily 
older  than  explicit  functions  is  that  implicit  processes  are  largely  age  independent. 
That  is,  implicit  learning  should  show  fewer  effects  of  age  and  developmental  level 
than  explicit  learning.  Interestingly,  the  issue  of  implicit  learning  and  human 
development  has  been  largely  overlooked.  The  type  of  empirical  data  that 
documents  adult  implicit  learning  is  scant  with  respect  to  children,  although  some 


Leaving  aside  the  debate  surrounding  the  definition  of  "mtelligence",  IQ  score  is  taken  to  he  a 
measure  of  overt,  academic-type  intelhgence  as  opposed  to  the  more  covrMyStoncI  found 
in  natural  environments  (Reber  et  al.  1991).  s™*     """ 
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research  has  addressed  this  issue.  Most  directly  similar  to  the  adult  experiments  are 
unpublished  studies  that  have  been  carried  out  by  Reber  and  colleagues.  Roter 
(1985  cited  in  Reber  1993)  used  a  modified  form  of  the  grammar  learning  task  with 
children  of  average  ages  of  4,  8,  14  and  a  group  of  college  age  adults.  The 
youngest  children  in  the  study  learned  the  grammar  and  showed  no  differences  in 
behavior  from  the  other  subjects.  In  another  study,  children  between  the  ages  of  3 
and  5  were  engaged  in  a  task  where  a  cartoon  character  named  "Max"  appeared  in 
several  marked  locations  on  a  computer  screen.  Their  task  was  to  "catch  Max"  by 
pressing  a  button  that  corresponded  to  his  location.  On  some  days  Max's  location 
was  determined  by  a  complex  set  of  rules  and  on  others  it  was  random.  Over  a  four 
day  period,  the  children  improved  their  response  times  to  Max's  location  when 
Max's  appearances  were  rule-governed.  Response  times  showed  no  improvement 
when  Max's  location  was  random.  The  conclusion  from  these  studies  was  that 
very  young  children,  like  adults,  are  able  to  implicitly  exploit  the  structure  in  their 
environments. 

Another  body  of  research  with  young  infants  (Haith,  Hazan  and 
Goodmanl988  in  Reber  1993)  suggests  that  young  children  and  infants  pick  up 
covariations  in  their  environment  and  exploit  them  behaviorally.  Infants  as  young 
as  three  months  have  been  shown  to  discern  alternating  patterns  in  a  stimulus 
display  and  anticipate  upcoming  events.  In  these  studies  infants  are  presented  with 
a  display  where  a  face  alternates  from  side  to  side.  Initially,  the  infants  follow  the 
appearance  of  each  face;  yet,  after  observing  several  alternations  they  begin  to  make 
anUcipatory  eye  movements  to  the  location  of  the  face's  next  appearance. 

Research  in  this  area  has  clearly  been  lacking.  Perhaps  because  of  the 
(implicit)  assumption  that  virtually  everything  that  young  children  learn  about  their 
social,  physical  and  linguistic  environments  proceeds  without  intentional  learning 
strategies,  there  has  been  little  interest  in  the  implicit  processes  among  children. 
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Neither  has  the  question  of  the  interaction  between  implicit  and  explicit  processes 
been  the  subject  of  much  research.  This  is  particularly  interesting  since  our  formal 
education  proceeds  from  the  assumption  that  learning  complex  knowledge  is  an 
explicit,  conscious  process.  At  the  same  time  educators  rely  on  the  fact  that 
children  bring  crucial  implicitly  acquired  knowledge  to  the  educational  stage. 

The  disparity  between  these  concurrent  assumptions  is  very  likely  to  result 
in  instructional  issues  (both  practical  and  theoretical)  that  come  from  the  failure  to 
recognize  that  two  different  types  of  learning  processes  are  at  work.  Many  issues 
of  reading  acquisition  and  reading  instruction  fall  squarely  in  this  domain. 
Learning  written  language  is  perhaps  the  first  complex  task  that  children  are 
presented  with  that  requires  them  to  bring  together  their  implicit  knowledge  with 
explicit  learning  techniques.  I  will  address  this  point  in  the  following  chapters. 

Some  General  Implications  for  Instruction 
Because  so  many  learning  situations  involve  both  implicit  and  explicit 
learning,  empirical  research  in  this  area  has  practical  import  for  issues  of 
instruction.  The  following  are  some  very  general  implications  of  this  research  so 
far  In  general,  the  more  obvious  or  salient  the  underlying  structure,  the  more 
useful  explicit  learning  and  explicit  training  becomes.  Training  does  not  necessarily 
lead  to  improved  performance  where  tasks  have  relatively  complex,  non-obvious 
underlying  structures. 

Where  the  underlying  structure  is  fairly  salient  (as  in  Mathews  et  al.'s  1989 
simple  artificial  grammar)  a  period  of  passive  observation  before  explicitly  engaging 
the  task  of  figuring  out  the  system  seems  to  greatly  facilitate  learning.  On  the  other 
hand,  where  underlying  structure  is  non-obvious  (as  in  a  Markovian  grammar), 
explicit  training  is  likely  to  be  disruptive  unless  presented  before  learners  begin  to 
form  their  own  implicit  representations.  That  is,  where  explicit  training  is  at  odds 
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with  the  system  the  learner  is  implicitly  inducing,  it  is  disruptive  to  learning.  Also, 
since  learners  (implicitly  acquired)  internal  representations  tend  to  vary  across 
individuals,  any  instructions  regarding  the  rule  system  are  likely  to  be  differ  from 
the  system  learners  have  begun  to  induce. 

The  effect  of  attempting  to  verbalize  what  has  been  learned  implicitly  seems 
to  have  a  positive  effect  on  performance.  Berry  and  Broadbent  ( 1984, 1990)  for 
example,  found  that  following  training,  people  who  were  asked  to  verbalize  during 
trials  how  they  were  controlling  the  system  performed  better  than  those  who  were 
provided  training  alone  or  those  who  were  asked  to  verbalize  alone. 

In  general,  people  are  much  more  likely  to  learn  if  they  are  actively  involved 
in  a  task,  as  opposed  to  watching  another  person  carry  out  the  task.  This  is 
particularly  relevant  to  implicit  learning.  The  less  obvious  the  structure,  the  more  it 
is  necessary  for  people  to  interact  directly  with  the  task  in  order  for  learning  to 
occur  (Berry  1991).  The  more  obvious  the  structure,  the  more  likely  people  are  to 
successfully  learn  by  watching,  rather  than  by  doing. 

Concluding  Remarks 
This  discussion  suggests  that  learning  is  the  result  of  the  operation  of  two 
cooperative,  complementary  systems:  implicit  and  explicit.  In  the  most  basic 
sense,  implicit  learning  refers  to  the  human  ability  to  extract  information  from  an 
environment  without  the  explicit  intention  of  doing  so,  and  without  being  (wholly) 
aware  of  the  resultant  knowledge  acquired.   Explicit  learning  refers  to  intentional 
processes  within  awareness  that  typically  engage  hypothesis  formation  and  testing, 
and  conscious  rule  application.  In  most  learning  situations,  these  functions  operate 
on  the  environment  simultaneously,  processing  different  types  of  information  in 
parallel.  That  is,  implicit  and  explicit  functions  operate  concurrently  on  the  type  of 
environmental  stimuli  that  they  are  'good  at'  processing.  The  resulting  knowledge 
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is  a  synthesis  of  implicit  and  explicit  analysis.  The  degree  to  which  the  learning 
process  and  the  knowledge  base  are  implicit/explicit  will  vary  as  a  function  of  what 
is  being  learned  and  what  task  is  employed  to  learn  it 

The  formal  educational  system  is  in  the  business  of  attending  to,  fostering 
and  promoting  explicit-type  learning  and  knowledge.  While  formal  education  relies 
on  implicitly  acquired  knowledge,  implicit  learning  processes  have  generally  not 
been  a  subject  of  pedagogical  concern.  As  previously  mentioned,  the  failure  to 
recognize  that  two  different  types  of  learning  processes  are  at  work  can  lead  to 
practical  difficulties.  I  contend  that  reading  acquisition  is  a  principal  example  of  a 
complex  learning  situation  where  implicit  and  explicit  processes  merge,  and  also 
where  educators  are  at  a  loss  to  (explicitly)  recognize  this. 

I  suggest  that  learning  to  process  written  language  involves  the  type  of  tasks 
(decoding,  comprehension,  interpretation)  and  type  of  system  (an  orthography)  that 
is  likely  to  engage  both  implicit  and  explicit  processes.  Furthermore,  written 
language  is  a  representation  of  spoken  language,  the  knowledge  base  for  which  is 
largely  implicit.  The  following  chapter  will  bring  together  the  previous  discussions 
of  reading  theory,  writing  systems  and  implicit  learning  in  order  to  illustrate  why  I 
believe  it  is  likely  that  reading  acquisition  involves  implicit  learning. 


CHAPTER 6 

IMPLICIT  LEARNING  AND  READING  ACQUISITION: 

PROCESSING  COMMONALTIES 


In  principle,  insight  into  the  interaction  between  explicit  and  implicit 
learning  operations  permits  a  reconceptualization  of  the  process  of  reading 
acquisition.  What  we  know  about  written  language  processing  and  the 
resemblances  among  writing  systems,  provides  a  framework  for  viewing  reading 
acquisition  as  an  interaction  between  implicit  and  explicit  learning  systems. 
Reading  acquisition  is  often  characterized  as  a  process  that  requires  explicit, 
conscious  analysis  of  how  the  written  code  maps  on  to  spoken  language,  and  that 
this  crucially  distinguishes  it  from  learning  spoken  language.  While  learning  to 
read  arguably  is  involves  more  explicit  processing  than  learning  to  speak  a  first 
language,  I  believe  there  is  considerable  evidence  that  much  of  learning  to  decode 
an  orthography  proceeds  implicitly. 

Drawing  on  the  specifics  of  prior  chapters,  I  will  argue  that  many  of  the 
circumstances  of  learning  to  read  parallel  those  of  an  implicit  learning  situation. 
More  specifically,  reading  acquisition  and  implicit  learning  have  the  following 
characteristics  in  common,  each  of  which  are  treated  in  a  separate  section  in  this 
chapter: 

Most  learning  situations  involve  an  interface  of  implicit  and  explicit 
processes.    The  degree  to  which  the  learning  process  and  the  knowledge  base  is 
implicit/explicit  is  a  function  of  the  complexity  of  the  system  is  being  learned 
(regularity  salience)  and  the  cognitive  demand  of  the  task  that  is  employed  to  learn  it 
(task  demands). 
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•  The  implicit  knowledge  base  is  created  by  breaking  up  the  stimulus  to 
analyzable  "chunks",  building  packages  of  knowledge  that  are  associated  in 
various  ways. 

•  Learning  is  gradual,  built  up  from  extensive  experience  with  the  system. 

•  Implicit  knowledge  is  resistant  to  conscious  inspection  but  can  be  used  to  make 
task  related  judgments.  Implicit  knowledge  is  always  ahead  of  the  possessor's 
ability  to  explicate  it 

•  Implicit  and  explicit  systems  are  complementary  and  function  together  in  the 
interest  of  efficient  processing. 

The  Implicit/Explicit  Interface 
Research  indicates  that  implicit  and  explicit  systems  simultaneously  operate 
on  the  types  of  information  that  they  are  respectively  good  at  processing.  The 
extent  to  which  learning  is  implicit  or  explicit  appears  to  depend  on  1)  the  nature  of 
the  system  being  learned  (its  level  of  regularity  salience ),  and  2)  the  type  of  task 
used  to  learn  it  (task  demands  ).  I  have  suggested  that  reading  acquisition,  like 
many  other  complex  activities,  can  be  explored  along  these  two  dimensions.  In 
particular,  learning  to  decode  an  orthography,  while  involving  some  explicit 
analysis,  also  relies  on  a  good  deal  of  implicit  processing.  Specifically,  I  am 
arguing  that  orthographies  appear  to  be  the  types  of  systems  likely  to  employ 
implicit  processes.  Also,  the  types  of  tasks  used  by  children  to  learn  to  read  do  not 
require  them  to  consciously  analyze  all  of  the  associations  present  in  the  system,  but 
only  to  attend  to  certain  salient  aspects  it. 

Orthographies  and  Regularity  Salience 

I  have  suggested  that  orthographies  are  the  types  of  systems  likely  to  engage 
implicit  processes.  Accordingly,  orthographic  systems  can  be  characterized  by 
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moderately  low  "regularity  salience,"  to  follow  Cleeremans'  (1993)  characterization 
of  the  term.  That  is,  many  of  the  orthographic  system's  regularities  are  not 
obvious.  This  is  true  in  various  respects.  First,  in  a  typical  implicit  learning 
environment,  the  number  of  cues  and  features  present  in  the  system  exceeds 
learners'  ability  to  attend  to  them.  Implicit  processes  are  likely  to  be  invoked  when 
the  regularities  are  masked  by  the  complexity  of  the  material  (Reber  1993; 
Cleeremans  1993). 

Correspondingly,  the  cues  for  decoding  an  orthography  are  typically  many, 
and  are  of  many  types.  As  presented  in  detail  in  chapter  three,  writing  systems 
represent  multiple  aspects  of  linguistic  structure  simultaneously.  Across  script 
types,  the  cues  for  recognizing  words  are  typically  are  a  combination  of  some  of  the 
following:  phonemic,  syllabic,  moraic,  morphophonemic,  morphological,  and 
lexical.  So,  for  each  word,  the  reader  is  simultaneously  interpreting  a  number  of 
different  aspects  of  a  linguistic  system. 

Further,  the  cues  for  each  particular  level  of  representation  are  often  of 
many  types,  and  are  not  uniformly  perceptible.  Implicit  learning  research  has 
indicated  that  explicit  analysis  is  likely  to  occur  when  a  system's  cues  are  obvious 
enough  to  find.    In  orthographies,  representation  is  sometimes  overt,  other  times  it 
is  quite  indistinct.  An  overt  cue  may  be  in  the  form  of  a  discrete  unit  set  apart  by 
spaces.  For  instance,  one  letter  can  indicate  one  segment  in  alphabets,  one 
syllabogram  can  indicate  one  syllable,  one  character  can  indicate  one  morpheme, 
and  words  are  often  set  apart  by  spaces.   However,  different  types  of  information 
(lexical,  sub-lexical,  orthographic)  is  often  indicated  by  patterns  of  symbols,  often 
without  boundary  markers  between  constituents,  making  representation  less 
obvious.  For  example,  in  segmental  scripts,  syllabic,  morphophonemic  and 
morphological  information  is  indicated  by  patterns  of  segments.   In  English  for 
instance,  -ed  consistently  represents  the  past  tense,  though  it  is  variably 
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pronounced.  The  written  sequence  -ight  consistently  represents  the  rime,  /ait/,  but 
the  individual  letters  do  not  consistently  correspond  to  particular  phonemes.  Even 
phonemic  information  is  often  indicated  by  patterns  over  units  rather  than  individual 
units.  Segments  are  often  indicated  by  more  than  one  letter,  sometimes  even 
crossing  syllable  or  morpheme  boundaries.  In  English  script  for  instance,  double 
consonants  can  distinguish  the  preceding  vowel,  often  across  syllable  boundaries, 
e.g.,  a-pple.  Similarly,  Modern  Irish  script  uses  vowel  Digraphs  to  signal 
consonant  information.  One  vowel  in  the  vowel  bigraphs  (e.g.,  eaJeaJed)  mark  the 
quality  of  a  preceding  or  following  consonant,  the  other  serves  as  the  main  vowel 
(McManus  1996).  Further,  each  linguistic  constituent  represented  (phoneme, 
word,  rime,  etc.)  will  vary  in  terms  of  frequency. 

The  principle  of  combining  lower  order  symbols  in  systematic  ways  is 
common  to  different  orthographies.  Syllabic  and  logographic  scripts  also  convey 
various  types  of  information  by  symbol  patterns.  Many  syllabic  scripts 
(neosyllabaries,  Korean)  have  additive  structures.  That  is,  by  doubling  or  adding 
strokes  or  diacritics  to  basic  consonant  or  vowel  shapes,  other  consonants,  vowels 
or  tones  are  derived.  In  some  scripts,  spatial  information  like  stroke  directionality 
and  stroke  order  distinguishes  orthographic  symbols.  In  Chinese,  Japanese  Kami, 
and  Korean,  graphs  are  composed  of  a  finite  number  of  possible  strokes,  with 
certain  strokes  occurring  much  more  frequently  than  others.  The  number  of  strokes 
in  a  graph,  the  order  of  writing  them,  or  the  directionality  of  individual  strokes  may 
convey  information.  In  Korean,  for  example,  a  syllable  block  always  begins  with  a 
consonant  sign.  The  vowel  attaches  to  the  side  or  below  following  the  principles  of 
stroke  order.  Everything  "horizontal"  moves  from  left  to  right.  Everything 
"vertical"  moves  top  down.  This  applies  to  the  movement  of  the  individual  strokes, 
and  also  to  how  the  letters  combine.  For  instance,  a  consonant  is  written  first,  then 
the  vowel  is  attached  either  to  the  right  or  below  it  (King  1994). 
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There  is  a  fair  amount  of  orthographic  information  that  is  less  obvious  still. 
For  example,  readers  also  extract  orthographic  regularities  that  are  not  overtly 
encoded  linguistic  units-  statistical  information  about  sequences  that  do  and  do  not 
occur,  and  where  those  sequences  are  possible.  For  instance,  in  English  nt  is  a 
possible  sequence  with  a  particular  frequency,  but  it  occurs  only  syllable/word 
finally,  e.g.,  rent,  or  at  syllable  boundaries,  e.g.,  renting .    tn  occurs  but  only  at 
syllable  boundaries,  e.g.,  partner.   A  sequence  like  dddd  never  occurs.  Further, 
morpheme  and  syllable  boundaries  are  indicated  by  letter  sequence  frequencies 
(Seidenberg  1987).  Also,  because  many  things  are  represented  simultaneously  in 
scripts,  cues  may  be  overlapping  or  may  be  conflicting,  for  example,  pressing  is 
morphological  and  pressing  is  syllabic. 

In  principle,  readers'  ability  to  exploit  statistical  patterns  should  hold  for 
orthographies  in  general.30.  This  seems  fairly  obvious  for  segmental  and  syllabic 
scripts,  but  the  same  is  true  of  logographic  scripts.    For  example,  in  Chinese,  there 
may  be  several  "phonetic"  symbols  for  indicating  the  same  syllable,  some  occurring 
much  more  often  than  others  (Coulmas  1989).  Many  Chinese  words  are 
represented  by  two  (compound)  characters.  Each  character  of  a  compound  may 
represent  a  morpheme  in  its  own  right,  or  may  represent  a  morpheme  only  when  in 
combination  with  another  character,  as  with  "binding  words."  The  characters 
which  compose  binding  words  have  particular  probabilities  of  occurring  in  first  or 
second  position.  Also,  characters  that  form  binding  words  may  occur  in  only  one 
word,  or  more  than  one  word.  Readers  have  been  shown  to  be  sensitive  to  these 
frequencies  (Taft  and  Zhu  1995).  Further,  the  individual  strokes  that  compose 
characters  have  particular  frequencies,  some  occurring  far  more  often  than  others. 

Another  characteristic  of  orthographies  that  indicates  low  regularity  salience 
is  that  cues  and  associations  are  incomplete,  sometimes  conflicting,  and  have 


30  There  have  been  few  empirical  studies  of  this  with  non-alphabetic  scripts. 
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numerous  exceptions.  Likewise,  typical  implicit  learning  situations  present  subjects 
with  noisy  environments  in  which  the  cues  are  incomplete,  only  partially  valid,  and 
in  which  regularities  are  marred  with  exceptions  (Cleeremans  1993).    As  discussed 
in  chapter  three,  all  orthographic  systems  are  incomplete  in  the  sense  that  they  omit 
much  of  the  linguistic  structure  in  the  spoken  system.  While  individual  scripts  vary 
as  to  the  specifics  of  what  they  represent,  it  is  common  across  script  types  that  each 
of  the  levels  of  linguistic  structure  is  only  partly  represented.  For  instance,  very 
regular  phonological  scripts  may  omit  suprasegmental  stress,  tone  and  pitch. 
Morphophonemic  alternations  may  be  represented  at  the  expense  of  consistent 
morphological  representations  (e.g.,  Italian).    Or  a  script  may  represent 
morphemes  regardless  of  their  variable  pronunciation  (e.g.,  English,  Korean). 
Interestingly,  both  English  and  Korean  scripts  represent  morphemes,  but  only  in 
certain  cases.  For  example,  Korean  tends  to  represent  nouns  and  verbs 
consistently,  but  not  necessarily  other  morphemes.  English  represents  many 
base/lexical  morphemes  consistently  (e.g.,  nation/ national,  attractl attraction); 
however,  sometimes  spelling  represents  pronunciation  (e.g., persuade/persuasion). 
All  logographic  scripts  include  varying  degrees  of  phonological  and  semantic 
information.  In  Chinese,  the  amounts  of  semantic  and  phonological  information 
varies  from  character  to  character.  In  some  cases  the  majority  of  distinctive 
information  is  phonological,  in  some  cases  it  is  semantic.  Japanese  represents 
certain  grammatical  elements  with  its  syllabic  script  and  primary  lexical  elements 
with  its  morphological  script,  even  though  everything  in  the  language  can  be 
expressed  by  the  syllabaries.   The  Semitic  scripts  (Arabic,  Hebrew)  represent  only 
consonants.  They  omit  the  vowels  and,  hence,  the  morphemes  those  vowels  stand 
for.  Word  boundary  markers,  such  as  spaces,  may  be  omitted  when  graph 
frequencies  make  those  boundaries  apparent  (e.g.,  Vai)  or  they  may  be  omitted  in 
favor  of  including  phrase  boundaries  (e.g.,  Malayalam  (Mohanan  1996)). 
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Furthermore,  scripts  are  "noisy"  in  the  sense  that  a  single  cue  can  represent 
several  different  things,  or  one  thing  can  be  represented  by  several  different  cues, 
each  with  a  particular  probability  of  occurring  in  various  environments.  Hence, 
many  orthographic  correspondences  vary  by  context  and  there  are  simply  many 
exceptions  to  the  regularities  that  do  exist.   This  is  common  among  scripts 
particularly  because  of  continual  changes  in  the  spoken  language,  intermittent  script 
reform  and  pressures  for  linguistic  preservation.    For  instance,  in  alphabetic 
scripts,  one  sound  may  be  represented  by  several  different  letters,  and  a  single  letter 
may  have  a  variety  of  possible  sounds.  Letter  sequences  also  may  a  have  a  variety 
of  possible  sounds  and  vice  versa.  When  there  are  more  sounds  than  letters  to 
represent  them,  one  symbol  may  come  to  represent  various  sounds  depending  on 
the  environment  in  which  it  occurs.  Consider  the  letter  s  in  English:  s  indicates 
several  different  sounds  and  two  different  morphemes  (plural  and  third  person). 
However,  the  meaning  and  pronunciation  of  s  is  largely  predictable  by  context, 
e.g.,  j  indicates  plural  after  nouns  and  agreement  after  verbs.  Word  final 
pronunciation  fori  is  Is/  after  voiceless  consonants  and  Izl  after  voiced  segments. 
Yet,  word  internal  s  does  not  follow  this  pattern.  It  may  be  pronounced  /s/  as  in 
horse,  chase,  /z/  as  in  please,  wise,  /|/as  in  pleasure,  or  Is/  as  in  passion. 
Likewise,  when  there  are  several  letters  to  represent  a  single  sound,  there  are  also 
contextual  (albeit  incomplete)  cues.  For  instance,  the  letters  *  and  c  both  represent 
the  sound  /k/,  but  k  occurs  more  often  before  the  letter  e  (key),  and  c  more  often 
before  the  letter  a,  (cat) . 

As  a  result  of  such  inconsistencies,  phoneme-to-letter(s)  correspondences 
are  not  simply  the  reverse  of  letter(s)-to-phoneme  correspondences.  For  example, 
in  English  the  phoneme  bl  is  rarely  spelled  au  (probability  =  0. 15),  yet  the  written 
sequence  au  is  almost  always  pronounced  hi  (probability  0.95)  (Berndt,  Reggia 
and  Mitchum  1987  in  Ellis  1994).  The  same  is  true  for  correspondences  between 
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letter  sequences  and  rimes  in  English.  For  example.,  -eak  is  pronounced  /ek/,  as  in 
steak  and  I'M  as  in  beak.    The  sequence  -aid  is  pronounced  fed/  as  in  said,  but 
usually  it  is  pronounced  /ed/  as  in  maitf.   Similarly,  syllabaries  often  have  more 
than  one  graph  to  indicate  a  single  syllable  or  vice  versa.  Due  to  a  great  deal  of 
homophony,  Chinese  script  has  many  "phonetic"  symbols  to  indicate  the  same 
syllable,  each  of  them  occurring  with  certain  classifiers  but  not  others. 

The  last  characteristic  of  low  regularity  salience  is  that  the  amount  of 
external  feedback  about  the  specific  relevance  of  cues  is  minimal 
(Cleeremansl993).   It  is  true  that  amount  of  explicit  instruction  and  feedback  given 
to  beginning  readers  will  vary  by  teaching  approach,  by  teacher,  and  by  parents.    A 
more  traditional,  teacher-directed  code  -  type  classroom  will  use  phonics  drills  and 
applications  which  emphasize  the  systematicity  of  reading.  They  may  also  include 
basal  readers  to  build  sight  vocabulary.   A  more  student-centered,  whole  language 
-type  classroom  has  children  "experience"  print  from  children's  books  and  from 
having  them  produce  their  own  books  and  word  lists.  This  approach  minimizes  the 
explicit  instruction  and  feedback.   Yet  even  a  traditional  classroom  with  rigorous 
phonics  program  draws  attention  to  only  a  fraction  of  the  system's  regularities. 
Phonics  typically  instructs  phoneme-grapheme  correspondences  and  portions  of 
their  contextual  "rules. "    As  pointed  out  previously  however,  phoneme-grapheme 
associations  are  only  part  of  the  total  information  in  any  alphabetic  system,  some 
alphabetic  systems  far  less  than  others.  So  while  some  crucial  features  of  the 
system  are  instructed,  the  total  amount  of  instruction  and  feedback  is  far  less  than 
the  total  amount  of  cues  and  features  the  reader  must  internalize. 

Hence,  instruction  and  feedback  is  far  less  than  what  is  learned  and  readers 
become  sensitive  to  underlying  regularities  (and  irregularities)  which  they  are  not 
explicitly  taught.  In  fact,  teachers  rely  on  the  fact  that  children  will  make  many  of 
the  associations  for  themselves  and  will  use  their  basic  knowledge  about  letter-to- 


109 


sound  correspondences  to  generalize  to  new  instances  that  have  not  been  taught. 
Reading  instruction  generally  presumes  a  certain  amount  of  orthographic 
knowledge  before  formal  teaching  begins.  Also,  phonics  rules  may  not  always 
correctly  capture  phonological  information  in  the  orthography. 

This  is  indeed  a  difficult  issue  to  capture  empirically  since  it  is  impossible 
to  control  for  children's  preexisting  knowledge  or  home  instruction.  However, 
Treiman's  (1993)  longitudinal  study  of  first  graders'  spelling  (whose  reading 
program  included  phonics)  suggests  that  children  do  learn  many  orthographic 
relationships  that  are  not  formally  taught.  Their  spelling  errors  suggest  that  first 
graders'  already  know  a  lot  about  orthographic  regularities-where  letter  sequences 
can  occur  and  where  they  can't.  For  example,  they  know  sequences  like  ck  don't 
occur  at  the  beginning  of  a  word.   And  although  at  this  age  they  are  primarily 
"phonemic  spellers,"  as  opposed  to  formal  spellers,  they  are  beginning  to  induce 
some  formal  properties  of  the  orthography,  many  of  which  are  not  instructed.  An 
interesting  example  of  this  is  illustrated  by  children's  spellings  of  the  phoneme  111. 
The  most  common  spelling  of  111  is  s,  both  in  words  that  first  graders  know,  and 
in  general.  However,  this  correspondence  was  not  pointed  out  to  the  children. 
Instead,  their  phonics  book  and  spelling  lists  stressed  the  correspondence  between  z 
and  It].    When  asked  how  to  spell  It],  they  were  told  z  by  the  teacher  and  (most 
likely)  parents.  Additionally,  their  knowledge  of  letter  names  should  cause  them  to 
spell  It]  as  z.    However,  most  children  spelled  111  as  s,  in  both  correctly  spelled  and 
misspelled  words3'.  This  may  be  both  because  children  become  sensitive  to  the 
typical  spelling  of  It]  and  also  because  they  implicitly  understand  the  plural  is 
represented  by  -s. 


An  interesting  question  that  this  analysis  did  not  tease  out  is:  are  children  equally  likely  to 
spell  Izl  as  s  in  all  contexts,  or  are  they  more  likely  to  spell  Izl  as  s  in  certain  contexts  - 
plurals,  word  finally,  word  internally,  etc.  That  is,  young  children  may  also  already  be  sensitive  to 
where  Izl  occurs  as  s,  as  well  as  how  often  it  does. 
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Task  Demands 

The  cognitive  demands  of  the  particular  tasks  used  to  learn  a  system  will 
also  influence  how  it  is  learned.  The  greater  the  demand  of  the  task,  the  more  likely 
learner  will  attempt  to  use  explicit  code-breaking  strategies  (Cleeremans  1993).  The 
fundamental  tasks  associated  with  beginning  reading  acquisition  are  recognizing, 
pronouncing,  and  spelling32  written  words.  To  this  end,  beginning  readers  are 
typically  given  the  following  associated  subtasks:  learning  the  basic  elements  of  the 
system,  (e.g.,  letters  of  the  alphabet),  and  to  varying  levels  of  explicitness,  learning 
the  basic  associations  between  linguistic  units  and  the  written  symbols,  (e.g., 
letters-to-phonemes).  They  may  also  be  required  to  "blend"  symbol  sounds  into 
words.  Beginning  readers  are  also  usually  required  to  memorize  meaning-based 
chunks  like  whole  words  (e.g.,  "sight  words"  in  English),  or  whole  characters 
(e.g.,  Chinese  characters,  Kanji). 

As  pointed  out  in  the  previous  chapter,  looking  for  structure  in  the  learning 
material  is  a  high  demand  task.  Actively  seeking  phonemic  correspondences  and 
blending  phonemes  is  high  demand,  and  as  a  result,  readers  do  attempt  explicit 
analysis  of  the  code  (e.g.,  "sounding  out"  words  in  order  to  read  or  spell  them). 
However,  such  tasks  do  not  require  children  to  consciously  analyze  aU  of  the 
associations  present  in  a  script.  Instead,  they  are  tasks  that  require  them  to  attend  to 
certain  obvious  aspects  of  the  system,  such  as  symbols  representing  sounds,  and 
groups  of  symbols  between  spaces  indicating  words.  Where  representation  is 
straightforward,  and  they  have  developed  some  awareness  of  linguistic  structures 


32   Arguably,  spelling  is  a  more  explicit  task  than  word  recognition,  and  performing  this  task 
should  result  in  a  more  explicit  knowledge  of  the  system.    In  fact,  in  a  series  of  studies,  Bradley 
and  Bryant  ( 1979, 1985)  found  that  early  readers'  reading  and  spelling  skills  were  not  equivalent. 
There  were  many  words  that  individual  children  could  read  but  not  spell  and  some  words  they 
could  spell  but  not  read.  The  authors  suggest  that  the  cues  learners  use  to  read  are  different  for 
reading  and  spelling.  The  task  of  spelling  appeared  to  result  in  exploiting  phonological 
connections,  while  word  recognition  relied  on  other  factors,  (unspecified).  I  believe  that  implicit 
and  explicit  learning  strategies  bear  on  this  issue.  This  difference  may  reflect  the  fact  that  spelling 
(actually  producing  the  letters  of  a  word)  is  a  more  explicit  task  than  word  recognition  and  hence 
more  likely  to  invoke  explicit  phonemic  recoding. 


Ill 


(e.g.,  phonemic  awareness),  young  children  can  successfully  perform  a  task  like 
letter-to-phoneme  translation.  Yet,  considering  the  quantity  and  diversity  of  (often 
conflicting)  cues  typically  present  in  scripts,  attempting  to  explicitly  analyze  all  of 
them  simultaneously  would  surely  be  unsuccessful,  particularly  for  young  children. 
And  although  children  are  faced  with  the  demanding  task  of  mapping  letters-to- 
phonemes,  they  are  also  usually  performing  other  lower  demand  tasks,  such  as 
looking  at  words  and  stories  and  committing  whole  words  to  memory.  In  doing 
so,  they  are  able  to  induce  many  of  the  systematic  properties  of  words  that  are  not 
pointed  out  to  them. 

To  summarize  regularity  salience  and  task  demands:   Writing  systems  have 
many  different  kinds  of  cues,  some  of  which  are  fairly  obvious,  many  of  which  are 
not   Most  notably,  writing  systems  have  the  characteristic  of  representing  multiple 
aspects  of  the  reader's  linguistic  knowledge  at  once,  many  of  the  associations  for 
which  are  not  straightforward.  Readers  also  internalize  orthographic  regularities 
that  are  not  overtly  encoded  linguistic  units.    Furthermore,  as  a  representation  of 
spoken  language,  writing  systems  are  incomplete,  indicating  particular  structures 
only  partially.  Because  spoken  languages  tend  to  change  more  rapidly  then  their 
written  form,  orthographies  tend  to  have  many  redundancies  and  partial  indicators, 
as  well  as  exceptions  that  don't  adhere  to  general  patterns.  Hence,  orthographies 
have  many  of  the  characteristics  that  indicate  low  regularity  salience  and  so  are  the 
type  of  systems  that  are  likely  to  engage  implicit  learning  processes. 

Yet,  while  scripts  have  many  characteristics  of  low  regularity  salience,  it  is 
not  the  case  that  orthographic  properties  should  be  wholly  unavailable  to 
consciousness.    In  fact,  some  orthographic  structures  are  quite  overt  and  can  be 
consciously  analyzed  (even  by  5-year-olds).  Moreover,  learners  are  typically 
engaged  in  the  explicit  task  of  making  (some  salient)  orthographic-linguistic 
correspondences.  And  becoming  consciously  aware  of  how  some  obvious 
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properties  of  an  orthography  corresponds  to  one's  own  linguistic  knowledge 
appears  to  be  an  important  component  of  learning  to  read.    Consequently,  as 
predicted  by  implicit  learning  research,  implicitand  explicit  mechanisms 
simultaneously  operate  on  the  types  of  information  that  they  are  respectively  good  at 
processing. 

Chunking 
Implicit  knowledge  is  fragmented,  consisting  of  many  local  "microrules" 
that  specify  relationships  between  elements.   There  is  evidence  that  within  implicit 
learning  phenomena,  a  knowledge  base  is  created  by  chunking,  or  the  natural 
tendency  to  break  sequences  of  material  into  smaller,  analyzable  bits.    By  virtue  of 
having  been  created  from  structured  material,  chunking  implicitly  encodes  the 
system's  constraints.  If  no  organization  exists  a  priori  in  the  stimuli,  subjects  will 
impose  their  own.   Learners  select  patterns  of  common  features  shared  by  many 
exemplars  and  build  a  body  of  knowledge  by  encoding  small  subsequences  of 
strings  (Reber  and  Lewis  1977;  Holland  et  al.  1986;  Servan-Schriber  and  Anderson 
1990;  Mathews  et  al.  1989;  Cleeremans  1993;  Reber  1993).  For  instance  ,  given  a 
string  TTXVPXVS  to  be  committed  to  memory,  subjects  may  first  create  the 
chunks  TTX  VP  XVS.  Following  strings  are  likely  to  be  perceived  in  terms  of 
previously  analyzed  chunks.  Hence,  new  strings  seem  less  complex  and  constrain 
further  chunking  (Servan-Schreiberand  Anderson  1990). 

There  is  considerable  psycholingusitic  evidence  (presented  in  chapter  four) 
that  readers  chunk  orthographic  information.  In  particular,  readers  break  up  words 
into  analyzable  fragments.  While  artificial  grammar  learning  involves  learning 
structure  of  meaningless  material,  with  reading,  the  stimulus  is  organized  into 
linguistic  units,  and  in  this  sense  the  chunks  are  predetermined.  Since 
orthographies  represent  various  aspects  of  the  reader's  grammar,  the  chunks  should 
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correspond  (more  or  less)  to  that  structure.  Empirical  evidence  indicates  that  this  is 
indeed  the  case.    For  instance,  readers  of  alphabetic  scripts  tend  to  chunk  words 
into  onset  and  rime  (Treiman  1992,  1993;  Goswami  and  Bryant  1992;  Goswami 
1986, 1988;  Glushko  1979,  1981)  and  into  their  various  morphemic  components 
(Chialant  and  Caramazza  1995;  Stoltz  and  Feldman  1995;  Laudanna  et  al.  1989; 
Feldman  1987).  Interestingly,  readers  analyze  morphemic  units  even  when  the 
letters  in  the  sequence  are  not  immediately  proximate.  This  is  true  of  Hebrew,  a 
language  where  inflectional  and  derivational  morphemes  are  nonconcatenative  but 
"break  up"  the  root  (Benton  and  Frost  1995).  It  seems  then,  that  readers  tend  to 
chunk  written  language  in  a  manner  that  is  consistent  with  linguistic  organization. 

Readers  also  induce  orthographic  regularities  by  extracting  small 
subsequences  of  material  and  their  contextual  frequencies.   For  example,  certain 
sequences  may  only  appear  syllable  finally  but  not  syllable  initially  or  vice  versa. 
nt  occurs  only  syllable  finally  and  across  syllables  while  tr  occurs  syllable  initially 
and  across  syllables.  Readers  have  been  shown  to  extract  morpheme  and  syllable 
boundaries  via  letter  chunks  within  words  (Seidenberg  1987).  For  instance,  the 
bigram  nv  as  in  anvil  occurs  only  at  syllable  boundaries.     Hence,  readers  extract 
overall  letter  sequence  frequencies  by  chunking  smaller  sequences  and  associating 
these  chunks  in  various  ways 

Learning  is  Gradual 

With  implicitly  acquired  knowledge,  learning  is  gradual,  built  up  from 

extensive  experience  with  the  system  (Berry  and  Broadbent  1988;  Reber  1993; 

Cleeremansl993)  Likewise,  orthographic  knowledge  is  built  up  gradually  from 

reading  experience*.  In  order  to  decode  efficiently,  readers  must  learn  extensive 

associated  linguistic  and  orthographic  patterns.    Regardless  of  how  beginning 

33  It  is  important  lo  point  out  here  that  attention  to  input,  not  simply  exposure,  is  a  necessary 
condition  for  learning  of  all  types.  No  type  of  learning  is  based  on  unattended  stimuli. 
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reading  experiences  are  organized  and  regardless  of  teaching  method,  this 

knowledge  base  is  assembled  progressively  with  reading  experience.  Perhaps  the 

one  uncontroversial  point  among  reading  researchers  and  educators  is  that  reading 

ability  is  dependent  on  reading  exposure.   Particularly  decoding,  unlike 

comprehension,  relies  on  a  knowledge  base  that  is  difficult  to  acquire  outside  of 

reading.  This  is  true  both  in  regard  to  learning  purely  orthographic  regularities- 

which  letter  sequences  can  occur  and  where--  and  also,  learning  how  the 

orthography  encodes  linguistic  structures. 

Treiman  (1993)  constructed  a  "orthographic  constraints  test"  which  tested 

knowledge  of  a  number  of  specific  orthographic  regularities  (e.g.,  "which  one 

looks  like  a  word  ffeb  orbeffV")  These  regularities  are  patterns  present  in 

orthography  that  are  not  taught.  A  first  study  indicated  that  among  kindergartners 

(56.4%  correct),  first  graders  (62.3%  correct),  second  graders  (83.2%  correct)  and 

college  students  (94.5%  correct),  orthographic  knowledge  increased  with  age 

and/or  print  exposure-    A  second  study  compared  the  results  of  the  orthographic 

constraints  test  with  the  same  subject's  score  on  a  reading  test.  Treiman  found  that 

the  better  readers  in  all  three  children's  groups  knew  more  about  orthographic 

patterns,  suggesting  that  age  is  not  the  indicator  of  amount  of  orthographic 

knowledge.  A  third  study  compared  two  other  groups  of  first  grade  students.  One 

group  was  an  urban,  working-class,  racially  mixed  group.  The  other  was  a  group 

of  suburban,  middle  and  upper  middle-class  children.  Presumably,  there  is  a 

significant  difference  in  pre-school  reading  exposure  between  these  groups.  As 

expected,  the  suburban  group  performed  significantly  better  on  the  orthographic 

constraints  test.  Combining  the  results  of  these  three  studies,  Treiman  concluded 

that  children  learn  many  properties  of  English  orthography  without  instruction  and 

that  this  knowledge  increases  with  print  exposure. 

^Geographically  regular  items  and  orthographic  violations  were  equally  easy  to  pronounce  in 
order  to  control  for  the  influence  of  subjects' phonological  knowledge. 
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Cunningham  and  Stanovich  (1990,  1991)  make  a  related  point.  They 
measured  third,  fourth,  fifth  and  sixth  grade  children's  reading  exposure  by  using  a 
Title  Recognition  Test.   Their  studies  showed  that  the  variation  in  print  exposure 
among  children  correlated  with  significant  variance  in  orthographic  knowledge- 
spelling  and  word  knowledge-even  when  1Q,  memory  ability,  and  phonological 
awareness  was  controlled.  These  studies  do  not  tease  out  the  different  types  of 
orthographic  knowledge:  linguistic  correspondences  and  orthographic  regularities. 
However,  they  do  indicate  that  reading  experience  correlates  with  general 
orthographic  knowledge-word  recognition  and  spelling-  of  third  to  sixth  grade 
children.    Again,  this  shows  that  knowledge  of  an  orthographic  system  increases 
over  time,  with  practice. 

Readers'  knowledge  of  the  many  linguistic  correspondences  in  an 
orthography  also  increases  gradually.  For  example,  how  derivational  and 
inflectional  morphology  is  represented  in  English  orthography  is  typically  not 
instructed  in  reading  or  spelling.   Eventually,  however,  as  their  reading  experience 
grows,  children  do  learn  morphological  associations,  regardless  of  minimal  or 
conflicting  instruction.  Treiman's  (1993)  study  showed  that  first  grade  children 
generally  do  not  yet  understand  the  morphological  basis  of  English  spelling.    For 
example,  the  children  in  her  study  tended  to  spell  the  past  tense  phonemically  (e.g., 
t,  d  ,  id,  ed ),  instead  of  consistently  spelling  it  -ed.  Otherwise,  they  tend  to  omit 
spelling  the  past  tense  all  together.  This  is  likely  the  result  of  several  of  factors. 
First,  instruction  focuses  on  the  phonemic  rather  than  morphological  aspects  of  the 
system,  so  their  spelling  reflects  their  attempts  to  use  their  explicit  knowledge. 
Second,  first  graders  do  not  see  many  morphologically  complex  words  (perhaps 
with  exception  of  the  plural)  in  their  readers  or  spelling  lists,  and  their  phonics  book 
presented  the  past  tense  alternations  very  minimally.  Interestingly,  the  fact  that  first 
graders  are  likely  to  omit  the  past  marker  entirely  in  their  spelling  suggests  that  they 
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are  at  least  beginning  to  recognize  it  as  a  distinct  unit.  Furthermore,  correct  spelling 
of  past  tense  verbs  were  rare  during  the  first  semester  but  were  more  common  in  the 
second  semester.  The  one  individual  child  whose  past  tense  progress  was  closely 
tracked  throughout  the  school  year  began  to  spell  regular  past  tense  forms  with  -ed 
by  the  end  of  the  first  grade  year.  Treiman  concluded  children  need  at  least  half  a 
year's  reading  experience  before  they  begin  to  grasp  the  representation  of  the 
regular  past  tense.  Children  clearly  need  more  experience  with  other 
morphologically  complex  forms  before  they  begin  to  induce  their  morphological 
spellings. 

On  Consciousness 
Implicit  learning  research  indicates  that  implicitly  acquired  knowledge  is 
resistant  to  conscious  inspection  and  analysis  but  it  can  be  used  to  make  subsequent 
task  related  judgments.   That  is,  implicit  knowledge  of  a  complex  system  is  general 
enough  to  be  used  with  new  instances,  but  is  also  difficult  to  express  when  asked  to 
articulate  that  knowledge.  This  is  not  to  say  that  implicit  knowledge  is  totally 
unconscious,  but  it  is  always  more  sophisticated  than  a  person's  ability  to  explicate 
it  (Reber  1993). 

Written  language  is  a  representation  of  spoken  language,  the  knowledge 
base  for  which  is  largely  implicit  As  this  discussion  has  indicated,  the  process  of 
learning  to  read  (both  prereading  and  reading  activities)  brings  a  certain  amount  of 
the  learners  implicit  knowledge  to  a  conscious  level,  e.g.,  phonological  awareness. 
Indeed,  the  reading  literature  strongly  suggests  that  a  certain  amount  of  linguistic 
awareness  correlates  with  reading  success,  whether  it  precedes  reading,  or  is  a 
result  of  reading,  or  both.  Hence  it  appears  that  reading  is  an  activity  that  both 
facilitates,  and  to  some  degree  requires,  conscious  inspection  of  one's  implicit 
knowledge. 
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Yet,  conscious  awareness  of  one's  linguistic  system  and  writing  system 
lags  behind  one's  implicit  knowledge  of  those  systems.  That  is,  explicit  knowledge 
of  the  spoken  and  written  language  systems  is  less  comprehensive  than  one's 
implicit  knowledge,  and  it  is  less  easily  articulated.  For  example,  young  children 
may  implicitly  learn  the  (English)  linguistic/orthographic  associations  for  the  plural 
and  past  tense  and  both  be  able  to  read  and  spell  them  correctly.  This  ability  is  sure 
to  differ  from  their  ability  to  explain  the  morphological  associations  or  how  they 
arrive  at  the  variable  pronunciations. 

Scholes  (1990,  1991)  and  Scholes  and  Willis  (1990)  have  done  a  number  of 
experimental  studies  which  indicate  that  readers'  ability  to  demonstrate  their 
knowledge  of  phonological,  morphological  and  lexical  associations  in  English 
orthography  is  less  than  their  implicit  knowledge  of  both  spoken  and  written 
language. 35  For  example,  the  results  of  phoneme  deletion  tasks  with  highly  literate 
speakers  show  that  readers  do  not  always  explicitly  understand  the  relationships 
between  sounds  and  letters.  Over  half  of  native  speaker,  college  students  who  were 
asked  to  delete  phonemes  that  were  not  uniquely  represented  by  a  single  letter  failed 
to  do  so.  (e.g.,  /takst/  taxed  take  away  the  /s/  =  /takt/  tactltacked  (Scholes  and 
Willis  1991).  This  does  not  mean  these  same  subjects  do  not  successfully  identify 
x  as  /ks/  when  reading  or  writing  text.  However,  it  is  clearly  difficult  to  bring  this 
knowledge  to  the  level  of  conscious  awareness,  even  when  explicitly  directed  to  do 


so. 


Implicit/Explict  Learning  is  Efficient 
Implicit  learning  research  suggests  that  implicit  and  explicit  learning  systems 
are  "synergistic"  (Mathews  etal.  1989).  Consequently,  the  total  learning 


*    T      Zlvg  a        ,rcnt  conduslon  from  *•">  experiments  than  the  authors  do.  They  conclude 
that  the  inability  to  explicitly  demonstrate  knowledge  of  phonemes  or  morphemes  indicates  thai 
they  are  not  psychologically  real  linguistic  structures.  I  am  arguing,  instead,  that  these  studies 
indicate  that  implicit  knowledge  oflinguistic  structures  is  difficult  to  articulate. 
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experience  is  both  more  effective,  and  more  nuanced,  than  it  would  be  if  these 
systems  operated  independently  of  each  other.  Reading  is  the  type  of  complex 
activity  that  benefits  from  concurrent  implicit  and  explicit  analysis. 

When  children  learn  to  read,  they  bring  together  their  tacit  linguistic 
knowledge  with  some  awareness  of  that  knowledge.  They  also  bring  together  a 
mixture  of  implicit  and  explicit  learning  of  orthographic  properties.  Moreover,  it  is 
the  complex  joining  of  tacit/conscious  knowledge  and  implicit/explicit  learning  of 
orthographies  that  defines  the  process  of  reading  acquisition.  The  complexity  of 
this  mixture  has  given  rise  to  questions  about  what  blend  is  optimal  for  successful 
reading:    How  developed  should  a  child's  verbal  language  abilities  be?  How 
aware  should  a  child  be  of  their  tacit  linguistic  knowledge  (e.g.,  phonological 
awareness)  to  read  successfully?  How  aware  will  the  skill  of  reading  make  a  child 
about  their  implicit  linguistic  knowledge?  How  much  preschool  print  exposure  is 
enough  for  reading  readiness?  How  much  reading  practice  is  necessary  to  be  a 
good  reader?  How  much  explicit  teaching  of  linguistic-graphic  relationships  is  the 
proper  amount?  The  issues  surrounding  all  of  these  questions  have  been  shown  to 
affect  reading  acquisition.  It  is  precisely  because  reading  is  a  product  of  the 
interaction  between  implicit/explicit  learning  and  knowledge  that  these  are  pertinent 
questions. 

I  have  suggested  in  the  previous  chapter  that  it  is  particularly  where  the 
implicit  and  explicit  merge  that  such  practical  and  theoretical  questions  arise. 
Implicit  learning  research  can  inform  reading  on  these  very  issues.  I  do  not  claim 
for  this  discussion  to  give  exact  answers  to  the  questions  like:  how  phonological ly 
aware  do  children  need  to  be  to  learn  to  read  ?  Or  how  much  explicit  instruction  is 
the  optimal  amount?   What  is  clear  from  this  discussion,  however,  is  the  answers 
to  the  above  questions  are  not  simply  "the  more  the  better."  Instead,  the  issues 
underlying  such  questions  will  undoubtedly  profit  from  a  clearer  understanding  of 
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implicit  and  explicit  mechanisms  and  the  recognition  that  they  are  complementary 
subsystems  that  operate  in  the  interest  of  processing  efficiency.  These  subsystems 
operate  in  parallel  on  environmental  features  that  they  are  respectively  good  at 
processing.  In  fact,  viewing  reading  acquisition  as  an  interface  of  implicit/explicit 
learning,  which  is  prompted  by  the  motivation  for  efficient  processing,  provides  a 
framework  through  which  these  reading  issues  can  be  more  effectively  addressed. 

Implicit  learning  research  has  some  specific  insights  which  bear  on  the 
debated  reading  issues  mentioned  above.  The  fact  is  that  some  types  of  information 
are  readily  processed  implicitly;  others  are  easily  and  effectively  with  explicit 
mechanisms.  Typically,  the  more  obvious  the  structure,  the  more  effective  explicit 
learning  is.  The  less  obvious  the  structure,  the  more  useful  implicit  analysis 
becomes.  I  have  argued  that  orthographies  have  some  obvious  but  also  many  non- 
obvious  properties.  Because  of  the  proclivity  for  processing  efficiency,  learners 
will  have  a  natural  tendency  to  engage  implicit  and  explicit  mechanisms  where  they 
are  most  effectively  used.  Consequently,  readers  will  learn  certain  orthographic 
properties  explicitly,  certain  others  implicitly.  This  will  have  specific  implications 
which  bear  on  many  of  the  unresolved  questions  of  reading  acquisition,  such  as, 
how  much  explicit  instruction  is  optimal?   I  will  address  these  points  in  the  next 
chapter  which  considers  the  theoretical  and  practical  implications  of  this  discussion. 

A  related  issue  is  the  relationship  of  decoding  and  comprehension.  The 
larger  discussion  has  focused  on  reading  acquisition  in  regard  to  learning  to  decode. 
However,  the  interdependence  of  decoding  and  comprehension  bears  directly  on  the 
argument  for  processing  efficiency.  It  is  highly  unlikely  that  young  children  could 
consciously  explore  all  of  the  many  linguistic  patterns  presented  to  them  by  an 
orthographic  system,  or  invoke  explicit  hypothesis  testing  strategies.  Even 
assuming  that  young  children  were  capable  of  this,  such  a  strategy  would  allocate 
tremendous  cognitive  resources  to  a  task  readily  handled  by  less  labor  intensive 
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implicit  processes.  In  the  interest  of  efficiency ,  it  is  more  adaptive  to  employ 
implicit  systems  for  processes  such  as  decoding,  thereby  reserving  explicit 
mechanisms  for  other  aspects  of  reading  which  are  likely  to  require  more  explicit, 
intentional  processing.  These  would  be  processes  involved  with  comprehension 
and  interpretation,  such  as  explicit  memory  and  attention,  making  semantic  and 
pragmatic  inferences,  and  reasoning.    In  fact,  reading  research  shows  that  one 
contributor  to  underdeveloped  reading  comprehension  skills  is  often  lack  of 
decoding  fluency.  The  evidence  indicates  that  better  readers  invest  far  less  explicit 
attention  to  decoding,  freeing  attention  and  memory  so  that  the  conceptual 
comprehension  of  the  message  is  very  high36  (Perfetti  1985;  McLeod  and 
McLaughlin  1986).  Here  again,  it  is  the  processing  system's  selective  partialling 
out  of  implicit  and  explicit  analysis  that  results  in  efficient  reading  skills. 

Toward  a  Theory  of  Reading  Acquisition: 
A  Proposed  Framework 

Motivated  by  the  effort  towards  efficient  processing,  reading  acquisition, 

like  other  complex  learning  situations,  involves  a  mixture  of  implicit  and  explicit 

learning  processes.  Employing  implicit  and  explicit  processes  to  a  learning  task  is  a 

matter  of  allocating  resources  where,  and  in  a  manner  that  they  can  be  most 

effectively  used.  Consequently,  acquired  knowledge  from  a  complex  learning 

environment  is  a  synthesis  of  implicit  and  explicit  analysis.  The  degree  to  which 

the  learning  process  and  the  knowledge  base  are  implicit/explicit  will  vary  as  a 

function  of  what  is  being  learned  and  what  task  is  employed  to  learn  it.  A  learning 

36  This  phenomenon  is  often  interpreted  to  be  the  result  of  decoding  becoming  an  automatic  as 
opposed  to  a  controlled  process  (Anderson  1983;  McLaughlin  1990).  The  controlled-automatic 
distinction  assumes  there  are  limitations  on  the  amount  of  information  that  can  be  attended  to  in 
working  memory.   When  decoding  becomes  automatic,  it  facilitates  subject  controlled  attention 
demanding  reasoning  type  processes  that  are  involved  in  understanding  tent  (Schneider  and  Shiffrin 
1977;  Baron  1977;  Singer  1984;  Liberman  et  al  1977).    Yet,  how  a  process  becomes  automatic 
may  vary.  A  process  may  become  automatic  through  practice  with  conscious,  intentional  learning 
strategies.  Implicit  processes  also  become  more  automated  with  practice  Thus  implicit-explicit 
is  a  separate  dimension  from  controlled-automatic  (Ellis  1994). 
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situation  that  has  a  very  wide  or  diverse  number  of  cues,  where  cues  are  partial  and 
sometimes  conflicting,  and  where  there  is  little  external  feedback  is  likely  to  engage 
implicit  processes.  Conversely,  systems  where  cues  are  complete,  valid  and 
obvious  are  likely  to  engage  explicit  learning.  Also,  where  the  task  used  to  learn  a 
system  is  cognitively  demanding,  learners  will  attempt  to  use  explicit  code-breaking 
strategies.  Where  task  demand  is  low,  implicit  learning  is  likely. 

Following  this  characterization  of  the  implicit/explicit  learning  interface, 
orthographies  appear  to  be  the  types  of  systems  that  will  invoke  some  explicit 
learning,  but  also  a  large  degree  of  implicit  induction.  Commonalties  among 
writing  systems  indicate  that  orthographies  are  complex  systems  representing 
multiple  levels  of  linguistic  structure.  Hence,  deciphering  them  will  involve  several 
different  kinds  of  analysis  occurring  simultaneously.  In  addition  to  being  of  many 
types,  associations  are  complex,  incomplete  and  sometimes  conflicting.  Moreover, 
there  is  evidence  that  readers  do  extract  these  many  types  of  information 
simultaneously.   Yet,  orthographies  are  not  completely  inaccessible  to  conscious 
analysis  and  reading  does  rely  on  conscious  interpretation  of  some  of  its  salient 
properties.  Furthermore,  the  tasks  involved  in  learning  to  read  promote  some 
explicit  and  some  implicit  learning.  Learning  to  read  then,  involves  an  interface  of 
implicit  and  explicit  processes,  each  being  employed  where  they  are  most 
effectively  used. 


CHAPTER 7 
PRACTICALAND  THEORETICAL  IMPLICATIONS 


Implications  for  Instruction 
I  have  stated  that  traditional  approaches  in  education  do  not  usually  consider 
the  role  implicit  learning  or  implicit  knowledge  plays  in  regard  to  formal  instruction. 
Traditional  teaching  methods  generally  assume  there  is  an  instruction  phase  where 
information  is  presented  in  more-or-less  factual  form,  followed  by  a  practice  phase 
where  that  knowledge  is  put  into  use.  Yet  there  are  types  of  learning  for  which  this 
basic  method  is  not  always  successful.  Were  reading  acquisition  the  type  of 
learning  that  fit  this  conventional  formula,  it  would  hardly  be  a  topic  of  continuing 
debate.  I  believe  that  the  framework  for  learning  to  read  that  I  have  suggested  has 
practical  implications  for  some  of  the  contentious  issues  of  reading  acquisition. 

When  is  Instruction  Useful? 

A  major  implication  of  the  implicit/explicit  learning  interface  is  that 
instruction  can  effectively  target  some  things  but  not  others.  Research  has  indicated 
that  in  general,  the  more  obvious  or  salient  the  underlying  structure,  the  more  useful 
explicit  learning  and  explicit  training  becomes.  However,  training  does  not 
necessarily  lead  to  improved  performance  where  systems  have  relatively  complex, 
non-obvious  underlying  structures.  Because  scripts  have  both  obvious  and  non- 
obvious  properties,  there  is  a  limit  to  the  amount  of  instruction  that  will  benefit 
reading  acquisition,  as  well  as  a  specificity  to  the  type  of  structures  that  should  be 
taught. 
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Which  properties  of  a  script  are  most  salient  will,  of  course,  vary  by  script 
type.  Segmental  scripts  are  most  obviously  composed  of  phonemes;  logographic 
scripts  are  most  obviously  composed  of  morphemes.  Many  scripts  of  different 
types  have  word  boundary  markers  of  some  kind.  Certainly,  some  explicit 
knowledge  of  these  basic  correspondences  are  necessary  for  reading.  But  also, 
some  explicit  knowledge  of  other  consistent  features  of  the  system  should  be 
beneficial.  In  English  for  instance,  many  of  the  words  that  readers  initially  learn  are 
monosyllabic.  These  words  are  more  consistently  represented  by  onsets  and  rimes, 
than  a  blending  of  individual  phonemes  (as  discussed  in  chapters  three  and  four). 
For  example,  the  onsets  and  rimes  of  words  like:  blare,  blank,  blast,  blah,  blame 
are  more  consistently  represented  than  are  the  individual  letters,  particularly  the 
vowels.  Furthermore,  children  and  adults  exploit  these  consistencies,  though  they 
may  not  be  explicitly  aware  of  it.  It  would  seem  preferable  to  point  out  this 
consistent  property  of  English  script  than  to  instruct  the  many,  and  much  less 
obvious,  vowel  rules  which  could  explain  these  pronunciations.  Conversely,  the 
pronunciation  of  English  multi-syllabic  words  has  many  contingencies: 
morphological  representation,  lexical  stress,  onset  and  rime  representation  and 
several  types  of  segmental  cues.  The  synthesis  of  these  diverse  orthographic  cues 
is  something  that  would  be  difficult  for  young  children  to  consciously  accomplish, 
and  so  learning  is  more  likely  to  efficiently  proceed  implicitly. 

Another  relevant  point  that  emerges  is  that  explicit  instructions  are  disruptive 
to  learning  when  not  in  accord  with  the  system  that  the  learner  is  implicitly 
inducing.  This  principle  is  applicable  to  reading  in  that  the  common  belief  that  an 
alphabetic  system  represents  a  single-leveled,  concatenation  of  phonemes  can 
influence  teachers  to  give  misinformation.  For  example,  students  may  be 
encouraged  to  look  for  phonemic  correspondences  where  representation  is 
morphological  or  syllabic.  There  is  substantial  evidence  from  children's  spelling 
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that  this  occurs.  While  children  eventually  induce  the  proper  correspondences,  it 
would  be  valuable  for  teachers  to  be  aware  that  the  system  is  not  simply 
phonological,  and  that  learning  to  decode  an  orthography  involves  interpreting 
different  levels  of  linguistic  structure. 

How  Much  Instruction  is  Optimal? 

An  issue  related  to  which  types  of  information  should  be  taught,  is  how 
much  instruction  is  necessary  and  how  comprehensive  need  it  be.  Reber  ( 1989, 
1993)  suggests  that  an  effective  use  of  explicit  instruction  for  learning  complex 
systems  is  to  direct  and  focus  the  learners'  attention  on  relevant  systematic 
properties,  rather  than  teaching  the  system  in  any  complete  way.  Learners,  in 
effect,  teach  themselves.  Similarly,  input-oriented  approaches  to  second  language 
acquisition  are  based  on  teaching  as  a  "cognitive  focusing  device"  to  facilitate 
implicit  learning.  In  this  view,  the  role  of  instruction  is  to  increase  the  salience  of 
target  language  forms  so  that  they  are  likely  to  be  noticed  by  learners  and  further 
learning  can  proceed  implicitly  (  Schmidt  1994).  Hence,  more  instruction  is  not 
always  better  for  teaching  complexly  structured  information.  Yet,  this  does  not 
negate  the  idea  that  explicit  focus  on  certain  salient  aspects  of  the  system  is 
beneficial  to  learning. 

How  Aware  must  Learners  be  of  Relevant  Preexisting  Knowledge? 

Another  pertinent  question  is  how  to  effectively  build  upon  implicit 
knowledge  children  bring  to  formal  instruction.  Instruction  may  have  a  role  to  play 
by  directing  the  learners'  focus  to  preexisting  tacit  knowledge  that  bears  on  the 
learning  task.  Since  writing  systems  are  a  representation  of  spoken  language, 
readers  must  (implicitly  or  explicitly)  apply  their  preexisting  linguistic  knowledge  to 
learning  to  read.  This  point  bears  directly  on  the  positive  connection  between 
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phonological  awareness  and  reading.  Many  studies  have  indicated  that  some  degree 
of  phonological  awareness  correlates  with  reading  success,  at  least  with  regard  to 
alphabetic  scripts.  It  has  also  been  noted  that  explicit  training  is  not  always 
necessary  for  beginning  readers'  awareness  of  linguistic  structures  to  progress. 
Some  learners  have  a  degree  of  linguistic  awareness  before  reading  begins;  others 
develop  it  as  they  leam  to  read.  Yet,  awareness  instruction  can  potentially  aid 
learners'  application  of  knowledge  they  already  bring  to  the  reading  task(s).    Some 
awareness  training  may  aid  learners  by  making  them  aware  of  connections  between 
knowledge  they  already  possess,   and  relevant  features  in  the  system,  and 
consequently,  guide  subsequent  learning.     Implicit  learning  research  suggests,  in 
fact,  that  training  linguistic  awareness,  which  calls  the  learners'  attention  to  relevant 
systematic  features,  would  be  more  likely  to  facilitate  further  learning  than 
attempting  to  instruct  all  the  specific  correspondences  and  "rules."  Yet,  it  is 
unnecessary,  perhaps  impossible,  for  learners  to  become  completely  aware  of  even 
one  level  (e.g.,  phonology)  of  their  linguistic  knowledge.  Here  again,  the  plausible 
maxim  "the  more  instruction  the  better"  does  not  seem  to  apply.  Instead,  linguistic 
awareness  instruction  can  direct  learners  attention  to  relevant  systematic  properties, 
so  that  they  are  able  to  effectively  apply  their  language  knowledge  to  learning  to 


read. 


Thinking  Aloud 

Studies  by  Berry  and  Broadbent  (1984,1987)  and  Berry  (1990)  have 
suggested  that  having  learners  talk  about  what  they  are  doing  while  they  are 
performing  a  learning  task  can  affect  implicit  learning  by  forcing  concentration  on 
the  way  in  which  they  are  carrying  out  the  task.  Verbalizing  about  the  task  while 
one  is  performing  it  may  make  learners  approach  the  task  differently,  perhaps  more 
systematically.  Admittedly,  this  may  have  limited  application  among  young 
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children.  Yet  perhaps,  "thinking  aloud"  by  having  readers  try  to  verbalize 
how/why  they  recognize  words  may  be  helpful  in  heightening  their  awareness  of 
relevant  linguistic  structures  in  a  script. 

The  Order  of  Implicit  and  Explicit  Learning 

The  order  of  implicit  or  explicit  learning  is  also  a  relevant  practical  concern. 
The  series  of  studies  done  by  Mathews  et  al.  ( 1989)  indicate  that  a  period  of  implicit 
observation  prior  to  instruction  is  very  beneficial  for  systems  that  engage  a  mixture 
of  implicit  and  explicit  learning.  For  systems  of  this  type,  developing  an  implicit 
learning  base  before  attempting  explicit  analysis  has  proved  to  be  beneficial  to 
learning.  Indeed,  this  is  widely  believed  to  be  the  case  with  reading.  The  notion 
that  exposure  to  print  in  early  childhood  "readies"  a  child  to  learn  to  read  has 
achieved  the  level  of  conventional  wisdom.  This  is  because  exposure  to  reading  in 
early  childhood  provides  children  with  a  period  of  observation  whereby  they  induce 
many  orthographic  properties,  and  they  are  able  to  construct  a  partial  knowledge 
base  before  schooling  begins37. 

In  fact,  educators  rely  on  the  assumption  that  most  children  will  bring  some 
orthographic  knowledge  to  the  beginning  reading  stage.  Overwhelmingly,  children 
that  have  been  underexposed  to  reading  stimuli  in  their  pre-school  years  start  out 
behind  in  reading  and  are  more  likely  to  have  initial  reading  difficulties  compared  to 
children  who  have  had  rich  print  experiences.  Furthermore,  underexposure  to  print 
in  the  pre-school  years  affects  subsequent  implicit  learning.  These  children  are 
likely  to  fall  further  behind  because  their  current  reading  experiences  are  unable  to 
build  on  previous  knowledge  of  print  and  language  that  better  readers  have  already 
acquired.  Because  of  these  difficulties,  the  total  amount  of  reading  experiences  will 
likely  be  fewer  (and  so  will  motivation  for  them).  As  previously  stated,  there  is 

37  Reading  experience,  of  course,  aids  in  the  development  of  other  reading-related  skills  as  well- 
developing  awareness  of  story  schemas,  for  example. 
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clear  evidence  that  reading  acquisition,  particularly  learning  an  orthography,  is 
gradual,  built  up  over  years  of  reading  experience.  Continuation  of  a  cycle  of 
underexposure  throughout  the  school  years  results  in  a  clear  distinction  between 
skilled  and  less-skilled  readers  in  terms  of  reading  experience  (Perfetti  1985). 

It  would  seem  that  targeting  these  children  early  on  would  be  a  key 
intervention  strategy.  They  are  in  need  of  instruction  that  does  not  assume  the  same 
pre-reading  knowledge  base  as  normally  progressing  students.  One  approach  to 
intervention  for  children  with  reading  difficulties  that  I  believe  is  suggested  by  this 
discussion,  is  to  simulate  the  prereading  environment  of  normally  progressing 
readers.  That  is,  in  conjunction  with  early  identification^,  these  children  would 
benefit  from  an  implicit  learning  period  (in  school)  where  they  are  read  to,  and  have 
their  attention  (visually)  focused  on  printed  language.  Further,  a  significant  degree 
of  familiarity  with  books  and  written  language  should  be  allowed  to  develop  before 
beginning  explicit  instructional  methods.  A  related  point  is  that  the  later  these 
children  are  identified,  the  less  likely  their  chances  are  of  catching  up  with  normal 
readers.  Waiting  until  they  are  tested  in  the  fourth  grade  and  then  targeting  them  for 
remediation  only  makes  instruction  that  much  more  difficult.  Since  what  they 
should  know  is  built  on  years  of  reading  practice,  short-term  remediation  is  unlikely 
to  be  effective. 

The  Role  of  Attention 

That  learning  can  occur  with  out  intention  to  learn,  does  not  mean  that 
implicit  learning  occurs  through  mere  exposure  to  the  input  Attention  to  input 
stimuli  is  critical  for  any  type  of  learning  to  occur.  Implicit  learning  is  not  a  wholly 
passive  "osmosis,"  but  a  byproduct  of  applying  attention  to  relevant  properties  of 


38  JhiSf|?eS,lion  appll/s  t0  ba<*ward  readers  without  biological  impairments  (dyslexia)  whose 
^dmgd.ff.culhesareo  adifferen, source.  Admittedly,  seating  these  two  JdingTsabied 
populahons  .s  already  a  large  task  for  the  school  systems,  albeit  an  essential  one. 
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the  system.  For  example,  in  artificial  grammar  learning,  subjects  are  attending  to 
committing  exemplar  strings  to  memory.  Knowledge  of  their  structure  is  a 
byproduct  of  attempting  to  memorize  many  exemplars.  With  reading  acquisition, 
learners  attend  to  various  things:  sound/symbol  relationships,  symbol(s)/meaning 
relationships  (words),  and  comprehending  the  message  of  sentences  and  the  larger 
text.   Attention  to  these  things  are  essential  for  implicit  learning  to  proceed. 

How  learners'  attention  is  best  focused  is  a  point  of  contention  among 
practitioners.  The  debate  turns  on  the  questions:  will  focusing  directly  on  the 
formal  properties  of  words,  or  focusing  on  the  comprehension  of  texts  best  point 
learners'  attention  to  relevant  features  in  a  script?  It  is  on  this  point  that  the  Code 
and  Whole  Language  teaching  philosophies  place  different  emphasis.  Interestingly, 
implicit  learning  research  supports  the  Code  philosophy  on  this  central  point: 
attention  to  some  aspect  of  form  is  requisite  for  implicit  learning  to  proceed.  For 
instance,  comprehending  the  meaning  of  a  sentence  is  not  possible  with  out 
attending  to  the  individual  words  that  compose  it.  Yet,  it  is  also  true  that  the  desire 
for  comprehension  is  ultimately  the  functional  motivation  for  attending  to 
orthographic  form.  Hence,  attention  to  both  form  and  comprehension  is  necessary, 
each  to  facilitate  the  other.  Emphasizing  learners'  attention  to  one  or  the  other- 
either  form  or  comprehension-seems  to  be  less  in  the  interest  of  learning  than 
understanding  the  synergistic  relationship  between  them. 

Indeed,  successful  readers  must  attend  to  both  form  and  comprehension, 
regardless  of  whether  they  are  in  code  or  whole-language  focused  classrooms. 
Interestingly,  research  that  has  attempted  to  evaluate  the  overall  effectiveness  of 
these  respective  teaching  approaches  suggests  that  the  differences  between  them  are 
undramatic,  although  instruction  that  includes  code  teaching  seems  to  be 
advantageous  for  some  things  (spelling  and  word  recognition  for  phonemically 
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regular  words)  and  not  harmful39  for  others  (Perfetti  1985,  Dewalt  1993).  This 
conclusion  underscores  the  idea  that  reading  acquisition  involves  a  mixture  of 
learning  strategies,  regardless  of  teaching  orientation.  It  should  not  be  assumed 
therefore,  that  learners  in  form-focused  classrooms  do  not  learn  implicitly,  or  do 
not  attend  to  comprehension.    Nor  should  it  be  assumed  that  learners  in 
comprehension-focused  classrooms  do  not  attend  to  form.    And,  it  should  be  noted 
that,  in  the  absence  of  the  recognition  of  the  implicit/explicit  dynamic,  such 
observations  remain  unexplained. 

A  Synthesis  of  Code  and  Whole  Language  Insights 

Both  the  code  and  whole-language  teaching  philosophies  reflect  important 
generalizations  about  reading  acquisition.  The  code  approach  asserts  that  learning 
to  read  hinges  on  the  conscious  attempt  to  decipher  and  internalize  the  orthographic 
system.  Whole-language  captures  the  idea  that  readers  induce  many  of  the 
principles  of  the  orthographic  system  naturally,  without  a  lot  of  explicit  instruction 
or  conscious  analysis.  These  different  perspectives  need  not  be  viewed  as  making 
opposing  assumptions  about  learning.  Alternatively,  I  contend  that  both  of  these 
generalizations  are  partly  true;  products  of  the  interface  between  implicit  and  explicit 
learning  processes. 

I  believe  the  framework  I  have  suggested  can  provide  a  bridge  between  the 

two  approaches  such  that  one  can  see  the  utility  in  integrating  the  some  of  the 

insights  of  both.  The  challenges  of  this  synthesis  are  the  following:  (1)  To  move 

toward  an  approach  that  recognizes  that  the  orthographic  code  is  multileveled, 

where  associations  are  many,  complex,  incomplete,  and  sometimes  conflicting. 

Hence  deciphering  the  code  will  involve  interpreting  many  types  of  cues 

simultaneously.  This  keeps  the  fundamentals  of  the  code  approach,  while  moving 

39  Whole  language  advocates  claim  thai  teaching  phoneme-grapheme  correspondences  and  spelling 
is  detrimental  to  the  main  task  of  comprehension. 


130 


away  from  the  dominant  perspective  that  readers  interpret  one  level  of  structure  at  a 
time.   (2)  To  recognize  that,  while  instruction  has  an  important  role  to  play,  much 
of  learning  to  read  proceeds  implicitly.  It  is  precisely  the  role  of  effective  reading 
instruction  to  promote  implicit  learning  by  focusing  learners  on  certain  salient 
formal  properties  of  the  system.  This  keeps  the  whole-language  insights  that 
readers  acquire  much  of  their  knowledge  on  their  own  and  that  it  is  the  teachers' 
role  to  facilitate  that  learning  process.   Finally,  (3)  to  recognize  that  instruction  that 
emphasizes  either  the  formal  properties  of  the  orthography,  or  comprehension  of 
the  global  message,  is  of  less  utility  than  a  teaching  philosophy  that  recognizes  and 
exploits  the  mutual  dependence  of  attention  to  both  form  and  comprehension. 

Theoretical  Implications 
This  study  has  drawn  on  the  literatures  in  reading,  the  linguistic  study  of 
writing  systems  and  implicit  learning  in  order  to  propose  a  framework  toward  a 
theory  of  reading  acquisition.  Usefully  integrating  these  literatures  not  only  permits 
a  reconceptualization  of  reading  acquisition,  but  also  has  theoretical  import  for  each 
individual  discipline  and  for  the  cognitive  sciences  in  general.  Accordingly,  each  of 
the  individual  conclusions  that  I  have  drawn,  and  their  integration  toward  a 
reconceptualization  of  reading,  are  theoretically  relevant.  In  this  section  however,  I 
will  discuss  relevant  theoretical  issues  that  I  have  not  addressed  at  length  in  the 
body  of  this  discussion. 

The  Optimalitv  of  Script  Type 

Treatments  of  the  history  of  writing  systems  have  often  speculated  about  the 
optimality  of  particular  scripts  in  terms  of  ease  of  acquisition  and  ease  of 
processing.  The  notion  that  the  alphabet  is  optimally  efficient  type  for  any  language 
and  is  the  culmination  of  an  evolutionary  process  (Gelb  1963)  still  has  considerable 


131 


currency*  despite  the  lack  of  linguistic  or  psychological  support  to  these  claims. 
There  is  no  conclusive  evidence  to  suggest  any  particular  script  type  is  learned  more 
easily  or  processed  more  efficiently.  Such  claims  are  based  on  fairly  superficial 
speculations  rather  than  studies  that  contrast  the  acquisition  and  processing  over 
various  script  types.  For  example,  these  treatments  usually  discuss  the  relative 
efficiency  of  a  particular  script  in  terms  of  the  number  of  signs  the  reader  must 
learn,  and/or  how  "elementary"  the  representation  of  spoken  language  is 
(Henderson  1982).  For  example,  segmental  scripts  represent  phonemes, 
presumably  the  most  basic  unit  of  speech.  Also,  a  script  is  generally  considered 
efficient  if  it  consistently  and  completely  represents  the  basic  elements  of  speech. 
This  conception  of  optimality  relies  on  the  assumptions  that  1)  fewer  cues 
are  more  easily  and  efficiently  processed  than  more  cues,  hence  alphabets'  inherent 
advantage  over  syllabaries  and  logographies,  2)  representing  the  phonemic  level  of 
speech  means  representing  the  most  elementary  level  of  spoken  language,  which  is 
a  priori  a  good  thing,  and  3)  consistently  representing  one  level  (optimally 
phonemic)  of  spoken  language  is  better  than  a  mixed  or  incomplete  representation. 
While  it  is  true  that  an  explicitly  obvious  script  with  a  minimal  number  of  systematic 
signs  would  present  certain  advantages  in  terms  of  processing,  e.g.,  explicit 
analysis,  it  would  also  present  disadvantages.    Contrary  to  the  notion  that  a  single 
leveled,  complete,  explicitly  analyzable  script  would  be  optimal,  the  fact  that  all 
scripts  are  multileveled  and  incomplete  suggests  that  there  is  some  processing 
advantage  for  scripts  to  be  structured  in  this  manner.  Moreover,  multileveled 
systems  represent  various  aspects  of  a  linguistic  system  that  is  already  part  of  the 
reader's  tacit  linguistic  knowledge.  It  follows  that  the  visual  system  would  profit 
from  making  connections  to  language  representations  already  in  place.  Also,  there 


There  is  a  fairly  extensive  literature  which  not  only  support  these  ideas  but  suggest  that 
learning  an  alphabetic  scnpt  gives  the  learner  a  general  intellectual  advantage  (McLuhan  1962' 
Olson  1977,1991;GoodyandWatt  1988;.HaveIock  1991). 
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is  no  reason  to  assume  that  an  entirely  consistent  script  (e.g.,  exclusively 
phonemic)  would  be  more  efficient  than  a  mixed  one,  since  what  is  gained  in 
regularity  is  lost  in  the  inability  to  make  multiple  connections  simultaneously. 

It  is  reasonable  to  conclude  that  it  is  because  of  human  processing 
constraints  that  scripts  have  the  formal  properties  that  they  do.  Neurobiological 
evidence  suggests  that  no  mechanism  in  the  human  brain  is  able  to  extract  all  the 
information  from  the  auditory  or  visual  stimulus.    Hence,  the  stimulus  is  broken 
up,  small  bits  of  the  information  are  processed  by  different  subsytems  which  extract 
different  kinds  of  information,  analyzing  particularly  different  properties,  in  order 
to  get  to  meaning  as  quickly  as  possible  (Galaburda  1996).  Linguistic  organization 
is  itself  a  reflection  of  this  type  of  processing.  Due  to  the  nature  of  linguistic 
complexity,  language  is  organized  into  several  structural  levels.  Further,  there  is 
considerable  evidence  that  various  levels  of  linguistic  structure  are  processed 
differently,  and  in  different  parts  of  the  brain.   Hence,  it  is  functional  for 
orthographies  to  represent  multiple  aspects  of  linguistic  structure  simultaneously 
both  because  such  an  organization  mirrors  the  way  languages  are  organized,  and 
because  it  is  consistent  with  the  way  people  extract  complex  information  in  general. 
Moreover,  the  interface  of  implicit  and  explicit  learning  systems  that  I  have 
suggested  in  regard  to  reading  acquisition  can  accommodate  the  facts  (as  we  know 
them  so  far)  of  multileveled  representation  and  processing. 

Since  the  primary  goal  of  an  orthographic  system  is  to  represent  spoken 
language  efficiently,  I  suggest  that  the  formal  properties  of  various  orthographies 
are  a  result  of  competing  subgoals  (presented  in  Fig.  7. 1 ),  each  of  which  attempt  to 
maximize  processing  efficiency  in  some  respect,  either  the  implicit  or  explicit 
domain.   Consequently,  certain  of  these  competing  goals  will  facilitate  implicit 
analysis  and  others  explicit  analysis.  For  example,  a  particular  script  may  have 
certain  systematic  features  that  are  very  obvious  (e.g.,  one  graphic  unit  =  one 
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linguistic  unit),  resulting  in  efficient  explicit  analysis  of  this  aspect  of  the 

orthography.  That  same  script  will  have  less  obvious  properties  as  well,  where 

implicit  analysis  is  effective. 

Fig.  7.1 

Goal:  Represent  spoken  language  efficiently 


Explicit 


I 


Competing  Subgoals 


I 


1)  Units  of  representation  should  be  easily  perceptible: 

•  perceptually  salient  (e.g.,  syllables  are  perceptually  salient  units). 

•  distinctive/obvious  (e.g.,  1  unit=  1  unit). 

•  as  few  in  number  as  possible  to  comprise  a  consistent  system. 


Implicit 


2)  The  system  should  include  many  types  (e.g.,  levels)  of 

information.  This  is  because 

•  language  includes  many  types  of  information. 

•  processing  of  complex  stimuli  (in  general)  occurs  by  analyzing  the 
many  different  types  of  information  simultaneously,  by  different 

subsystems  and  along  different  pathways. 

3)  There  is  default  toward  economy  (low  cost  behavior). 
-  There  is  a  tendency  to  use  a  minimal  amount  of  information  to 

achieve  the  goal  of  efficient  representation.  Since  writing  is  a 
representation  of  preexisting  knowledge,  it  is  unnecessary  to 
include  all  of  the  information  in  the  spoken  language. 


I 


Orthographic  Form 


The  variation  in  orthographic  form41  is  a  result  of  maximizing  certain  subgoals  over 
others.  Hence,  while  various  scripts  will  have  both  explicitly  and  implicitly 
analyzable  properties,  individual  scripts  will  vary  as  to  the  relative  amounts  of  these 
different  properties.  An  efficient,  working  orthography  will  include  elements  from 


41  In  addition  to  the  obvious  fact  that  orthographic  variation  is  a  result  of  representation  of 
different  language  types. 
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each  of  the  subgoals  above,  and  hence  facilitate  a  blending  of  implicit  and  explicit 
analysis. 


Implicit/  Explicit  Learning  and  Individual  Differences  and  IP 

It  has  been  hypothesized  that,  based  on  the  idea  that  evolutionarily  older 
functions  will  be  displayed  in  common  across  a  species,  implicit  cognitive  functions 
should  show  less  individual-to-individual  variability  than  explicit  functions  (Reber 
1989).  Implicit  learning  studies  have  explored  this  premise  and  suggested  that  there 
were  substantial  individual-to-individual  variability  among  subjects  performing  an 
explicit  learning  task  while  there  were  relatively  small  differences  among  subjects 
with  an  implicit  learning  task  (Reber,  Walkenfeld,  and  Hernstadt  1991) .   Further, 
the  authors  found  that  performance  on  the  explicit  task  strongly  correlated  with  IQ 
(taken  as  a  measurement  of  explicit-type  learning),  but  performance  on  the  implicit 
task  did  not. 

I  have  claimed  that  learning  to  decode  an  orthography  relies  to  a  large  degree 
on  implicit  processes.  The  view  above  suggests  that  there  should  be  lower 
individual  variability  in  regard  to  aspects  of  reading  acquisition  that  are  largely 
implicit,  than  those  that  require  explicit  learning  and  knowledge.  While  I  have 
stated  that  decoding  is  only  partly  a  result  of  implicit  processes  and  relies  on  an 
interface  between  implicit  and  explicit  learning,  it  does,  to  a  large  degree,  proceed 
implicitly.  Following  this  hypothesis,  if  learning  to  decoding  is  largely  implicit, 
then  it  should  be  fairly  stable  across  the  population 

Yet,  the  distinction  between  better  and  poorer  readers  in  terms  of  decoding 
ability  has  been  documented  many  times.  In  fact,  poorer  readers42  are  those  who 
perform  more  poorly  than  skilled  readers  on  a  variety  of  tasks:  orthographic 


42  Poorer  readers  are  distinguished  from  dyslexic  readers  whose  characteristics  are  quite  different 
typically  higher  IQ,  speech  and  language  abnonnalities,  lower  incidence  of  disadvantaged  homes. 
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knowledge,  phonemic  awareness,  word  naming,  listening  vocabulary,  use  of 
working  memory,  shorter  memory  spans,  and  so  on  (Perfetti  1985,  Stanovich, 
Nathan,  and  Vala-Rossi  1986).  Poorer  readers  are  also  generally  characterized  by 
low  IQ  scores,  general  developmental  delays  and  have  a  high  incidence  of 
disadvantaged  homes  (  Raynerand  Polletsek  1989).  It  should  also  be  noted  that 
poorer  readers  can  improve  their  language  and  reading  skills  with  the  appropriate 
help  (Bradley  and  Bryant  1983;  Mann  1984)  and  become  normal  readers.  While  it 
may  be  easy  to  assume  that  this  reading  difference  is  due  to  difference  in 
intelligence,  there  are  a  fair  number  of  poorer  readers  (not  dyslexic)  whose  IQ 
scores  suggest  that  they  should  be  reading  better  than  they  are.  I  believe  that  a 
major  factor  that  accounts  for  why  poorer  readers  are  deficient  in  both  explicit, 
intelligence-related  tasks,  and  more  implicit  decoding-related  tasks  is  that  poorer 
readers  of  all  IQ  levels  are  likely  to  be  stimulus-deprived.    That  is,  what  they  have 
in  common  is  that  they  are  generally  from  disadvantaged  homes  where  their 
experience  with  print  and  language  is  minimal,  an  inadequate  stimulus  environment 
for  implicit  (and  other)  learning  to  proceed  effectively.  Hence,  they  are 
underexposed  to  print  stimuli  and  lack  awareness  of  its  relationships  to  spoken 
language. 

Another  point  is  that  learning  to  decode  is  necessary  for  normal  reading  but 
does  not  necessarily  make  one  a  better  reader.  Hence,  better  readers  (measured  by 
comprehension  ability)  typically  have  good  decoding  skills  (Hoover  and  Tumner 
1993).    However,  good  decoders  do  not  necessarily  have  good  comprehension 
skills  and  may  not  necessarily  become  better  readers  (Perfetti  1985).  Furthermore, 
skills  associated  with  decoding,  such  as  phonemic  awareness,  phonemic 
segmentation  ability,  and  knowledge  of  orthographic  regularities  do  not  correlate 
with  IQ.  This  is  particularly  evident  in  the  early  grades.  However,  reading  ability 
in  general  (comprehension  and  so,  the  factors  that  influence  it)  in  the  later  school 
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grades  does  correlate  with  intelligence  (Byrne  and  Helding-Bamsley  1993; 
Foorman,  Francis,  Noby  and  Liberman  1991).  This  is  interesting  in  that  it  shows  a 
difference  in  the  types  of  learning  and  knowledge  necessary  for  reading,  yet  also, 
the  dependence  of  more-explicit  functions  on  implicitly  acquired  knowledge. 

On  Connectionism 

Processing  multileveled,  distributed  knowledge,  where  large  amounts  of 
sometimes  conflicting  information  is  brought  together  is  consistent  with  the  ideas  of 
parallel  distributed  processing/  connectionist  networks.  The  principle  of  distributed 
knowledge  is  central  to  theories  of  implicit  learning.  Connectionist  systems  have 
been  used  to  model  cognitive  phenomena  that  are  not  explicit  m  nature.  Explicit 
cognitive  operations  such  as  reasoning,  which  constructs  answers  from  stored 
knowledge  by  a  high  level,  goal  directed  process,  are  modeled  by  symbolic 
systems.  In  symbolic  systems,  knowledge  is  encoded  in  terms  of  explicit  symbolic 
structures,  and  inferences  are  based  on  rules  that  operate  on  these  structures.  In 
contrast,  implicit  processing  occurs  through  immediate  associations,  in  parallel, 
and  without  conscious  control  (Miikkulainen  1994).   As  a  result,  connectionist 
models  are  very  good  at  simulating  those  aspects  of  learning  -  implicit  learning  ~ 
that  symbolic  systems  are  not  equipped  to  handle. 

The  basic  strategy  of  the  connectionist  approach  is  to  take  as  its  fundamental 
processing  unit  something  close  to  an  abstract  neuron.  Neural  processes  are  slow 
(compared  to  a  computer)  but  the  human  brain  has  very  many  of  them.  Rather  than 
organize  computation  with  many  serial  steps,  as  with  systems  that  are  very  fast,  the 
brain  must  deploy  many  processing  elements  cooperatively  and  in  parallel  to  carry 
out  its  activities.  Computation  is  carried  out  through  simple  interconnection  among 
such  processing  units.  The  resultant  knowledge  is  in  the  connections,  or  the  rules 
for  forming  them  through  experience,  not  stored  in  units  in  the  system.    Hence,  in 
parallel  processing  networks,  representations  exist  within  several  levels  of 
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structure.  Such  systems  extract  cues  from  various  levels  at  once  and  then  select  the 
simplest  most  robust  network  consistent  with  observations  being  made.  These 
systems  are  plastic  in  the  sense  that  the  pattern  of  interconnection  is  not  fixed  for  all 
time;  rather,  strengths  of  the  connections  are  modified  by  experience.  In  this  way 
the  system  can  evolve  (Rumelhart  1993). 

The  multi-dimensional,  multi-faceted  composition  of  orthographies  and  their 
gradual  acquisition  are  in  accord  with  such  a  processing  network.  Connectionist 
interactive  activation  theories  of  reading  (Seidenberg  1985,  1987,  Seidenberg  and 
McClelland  1989)  have  significantly  advanced  dual-route  theories  by  emphasizing 
that  semantic  and  phonological  representations  are  interactive.  Furthermore,  such 
theories  posit  that  associative  maps  will  be  formed  for  all  systematic  recurring 
structures.  Hence,  theoretically,  they  can  accommodate  multileveled  processing  in 
general.  That  is,  all  linguistic  structures  (phonemes,  syllables,  morphemes)  present 
in  the  visual  stimulus  could  be  associated  via  the  same  process. 

Parallel  distributed  processing  (PDP)  networks  have  modeled  aspects  of 
human  language  learning  that  are  usually  considered  to  be  rule-governed  without 
specifying  any  rules  in  the  system.  Rumelhart  and  McClelland's  (1986)  model  of 
acquiring  English  past  tense  and  Lee  and  Gasser's  (1992)  model  of  learning 
English  plural  are  examples  of  successful  acquisition  without  recourse  to 
preexisting  rules  of  the  grammar.   Rumelhart  and  McClelland  (1986)  argue  that  the 
mechanisms  that  process  language  are  constructed  in  such  a  way  that  their 
performance  is  characterizable  by  rules,  yet  the  rules  themselves  are  not  written  in 
explicit  form  anywhere  in  the  mechanism.  This  model's  ability  to  process  some 
novel  inputs,  suggest  that,  to  some  degree,  the  system  was  able  to  abstract 
structures  and  develop  a  layout  of  "rules"  (my  word)  so  that  further  input  could  be 
interpreted. 
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Yet,  connectionism  has  limitations  for  modeling  human  learning,  which 
involves  a  mixture  of  implicit  and  explicit  processes.  It  has  been  argued,  for 
example,  that  PDP  networks  learn  to  slow  to  reflect  human  learning,  they  do  not 
account  for  the  critical  role  of  attention  plays  in  implicit  learning,  and  they  do  not 
account  for  rapid  acquisition  of  knowledge  through  explicit  rule-following  or 
"clicks"  of  comprehension  (Schmidt  1994).    Also,  connectionist  models  have 
difficulty  capturing  the  human  ability  to  abstract  the  structure  of  the  training 
examples  such  that  they  can  process  novel  inputs43.  These  limitations  are  directly 
related  to  the  fact  that  human  learning  is  both  implicit  and  explicit.  Because  of  this, 
there  have  been  proposals  for  "hybrid"  systems  that  incorporate  both  symbolic  and 
connectionist  processing.  Smolensky  (1988)  suggests  that  hybrid  systems  that 
integrate  a  rule  interpreter  and  an  intuitive  processor  are  necessary  to  model  the 
interface  between  implicit  and  explicit  learning  and  knowledge. 

Rule  Based  vs.  Association/Analogy  Based  Models 

Connectionist  models  are  based  on  the  idea  that  learning  is  fundamentally 
distributed  association-based  learning.   Similarly,  most  implicit  learning 
researchers  agree  that  encoding  small  fragments  of  knowledge  and  associating  them 
in  various  ways  is  central  to  the  learning  task.  With  reading  also,  there  is  a  good 
deal  of  empirical  evidence  which  suggests  representations  are  built  up  "by  analogy" 
(Glushko  1981;  Goswami  1988)  to  representations  already  present  in  memory. 
Seidenberg  (1987)  also  proposes  that  orthographic  knowledge  is  built  up  from 
extracting  fragments  of  information,  without  recourse  to  explicit  knowledge  of 
rules.  Further,  the  human  ability  to  transfer  knowledge  of  a  system's  structures  - 
whether  it  be  an  artificial  grammar  or  an  orthography  -  to  novel  inputs  means  they 


The  McCIeUand  and  Rumelhart  (1986)  model  was  able  to  process  some  novel  inputs  though 
imperfectly.  This  is  a  fundamental  limitation  that  connectionist  models  are  trying  to  address  and 
resolve.  '    e 
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have  developed  an  abstract  knowledge  base.   The  challenge  for  analogy  based 
models  in  implicit  learning  and  reading,  like  connectionist  models  in  general,  is  that 
the  learning  system  must  be  able  to  abstract  a  layout  of  structures  and  rules  such 
that  novel  inputs  can  be  processed. 

Implications  for  Linguistics 

What  does  reading  have  to  do  with  language?   Modern  linguistics  has 
traditionally  focused  on  spoken  language  as  its  primary  object  of  inquiry.  The 
rationale  for  the  exclusion  of  written  language  as  an  object  of  linguistic  interest  is 
basically  that  written  language  is  only  a  representation  (an  incomplete  copy)  of 
natural  language,  not  the  reverse.  Consequently,  it  is  presumed  that  the  facts  of 
written  language  systems  do  not  bear  on  spoken  language  and  will  not  inform  the 
basic  questions  of  linguistic  inquiry.  Yet,  the  fact  that  written  language  is  a 
secondary  code  does  not  mean  it  is  irrelevant  to  linguistic  inquiry.  I  am  arguing  that 
since  orthographies  represent  language,  they  are  themselves  evidence  of  linguistic 
structures  and  of  what  people  know  about  their  language.  Furthermore,  they  are 
evidence  of  constraints  on  language  processing.  I  have  suggested  that  scripts  have 
adapted  as  they  have  because  of  the  way  people  process  language.  Hence,  form  is 
a  result  of  function. 

Language  acquisition  and  implicit  learning.    Language  acquisition  is 
typically  evaluated  in  two  basic  ways:  1)  the  explication  of  the  content  of  the 
material  acquired,  the  depiction  of  what  is  learned  and  2)  the  characterization  of 
the  learning  processes  itself,  the  theory  of  ho  w  it  is  acquired  (Reber  1973). 
Linguistics  has  conventionally  been  more  interested  in  issues  of  content  rather  than 
issues  of  process.  As  Chomsky  (1959:  55)  established  it,  "It  is  futile  to  inquire  into 
the  causation  of  verbal  behavior  until  much  more  is  known  about  the  specific 
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character  of  this  behavior;  and  there  little  point  in  speculation  about  the  process  of 
acquisition  without  much  better  understanding  of  what  is  acquired"  [emphasis 
mine].  Consequently,  contemporary  theory(s)  of  language  acquisition  have 
emphasized  the  complexity  of  the  final  knowledge  base  rather  than  how  the 
knowledge  was  acquired. 

Accordingly,  Principles  and  Parameters  theory  of  language  has  treated  (1 
believe  sidestepped)  the  acquisition  process  issue  by  positing  a  content-specific 
Language  Acquisition  Device  (LAD)  which  is  constrained  by  Universal  Grammar 
(UG)--  an  innate  language  faculty  unique  to  the  human  species.    In  this  view,  since 
the  linguistic  knowledge  base  is  too  complex  to  be  learned  from  input  alone,  the 
human  mind  comes  innately  equipped  with  structures  which  define  the  linguistic 
content  (principles)  of  all  languages.  Cross-linguistic  variation  is  a  result  of  these 
principles  instantiating  themselves  in  slightly  different  ways  (parameters).  Since 
language  learning  is  seen  as  more  than  inducing  rules  from  input,  speakers  are 
assumed  to  know  much  more  than  they  have  learned.  Furthermore,  constrained  by 
innate  learning  mechanisms  which  are  preprogrammed  with  the  principles  of  UG, 
language  acquisition  proceeds  implicitly  and  results  in  an  implicit  knowledge  base. 
These  innate  learning  mechanisms  are  language-specific  and  apply  to  no  other 
behavior. 

Positing  a  language-only,  implicit  acquisition  device  has  provided  a  useful 

framework  for  a  discipline  that  is  predominantly  interested  in  exploring  (and 

emphasizing  the  complexity  of)  the  linguistic  knowledge  base.  Still,  if  one  chooses 

to  adopt  a  perspective  that  is  more  interested  in  how  knowledge  is  acquired,  it  is  of 

little  use.  For  example,  positing  a  LAD  and  UG  crucially  relies  on  the  concepts  of 

implicit  learning  and  implicit  knowledge.  Yet,  they  explain  little  about  implicit 

learning  or  learning  in  general.  Since  in  the  UG  view*",  the  content  of  learning  is 

44  I  am  presenting  a  strong  Chomskian  view  of  Universal  Grammar  theory  which  concerns  itself 
with  grammatical  competence  -  a  speakers  grammatical  knowledge.  In  this  perspective  language 
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largely  prespecified,  there  is  no  need  to  develop  a  theory  of  learning  that  can 
account  for  language  acquisition.   For  instance,  the  implicit  learning  mechanisms 
conceptualized  in  the  UG  framework  are  very  simple  trigger-like  mechanisms, 
which  induce  enough  input  information  to  invoke  linguistic  principles  and  set 
individual  language  parameters. 

Yet,  there  is  general  agreement  among  implicit  learning  researchers  (Dienes 
and  Berry  1993;  Cleeremans  1993;  Winter  and  Reber  1994)  and  some  in  the  field  of 
second  language  acquisition  (Schmidt  1994),  that  implicit  learning  seems  to  be 
based  on  some  general  mechanisms  of  human  learning.    While  it  is  possible  that 
mutually  exclusive  implicit  learning  mechanisms  have  evolved  (one  for  language 
and  one  domain  general),  there  seems  little  adaptive  utility  in  it.  Indeed,  one  of  the 
fundamentals  of  Darwinian  evolution45  is  that  traits  that  are  adaptationally 
successful  to  a  species  continue  and  become  foundations  for  new  developments. 
Older  traits  that  are  preserved  continue  to  be  used  to  purposes  other  than  those  for 
which  they  were  originally  adapted.  Language  is  an  example  of  an  adaptationally 
successful  process  that  was  built  on  brain  and  body  structures  that  were/are  used 
for  other  functions.  Organs  essential  for  speech  --  nasal  system,  tongue,  larynx, 
lungs,  and  so  on  --were  adapted  for  other  purposes  before  they  became  used  for 
speech.  Similarly,  parts  of  the  brain  associated  with  language  are  also  associated 
with  other  functions,  e.g.,  speech  production  and  motor  control  for  ingestive 
processes  (MacNeilage  1988).  Therefore,  from  a  functionalist  view,  it  is  likely  that 
the  implicit  learning  involved  with  language  acquisition  relies  on,  or  was  built  on, 
general  implicit  learning  mechanisms.  Furthermore,  from  this  perspective,  content 


acquisition  means.ideal  learning  unaffected  by  the  developmental  concerns  of  processing  and 
memory  constraints,  maturation  or  motivation.  There  are,  of  course,  competing  perspectives 
within  linguistics,  and  I  do  not  suggest  that  all  who  endorse  the  concept  of  Universal  Grammar  are 
unconcerned  with  acquisition  and  process  issues 

Evolution  through  inherited  random  variation  and  natural  selection.  Reber  (1989  1993)  posits 
an  account  of  domain  general  implicit  learning  from  a  functionalist,  evolutionary  perspectivr 
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specific  learning  mechanisms  like  UG's  LAD  require  evolutionary  biological  events 
that  are  highly  unlikely  (Deacon  1991). 

The  existence  of  a  truly  domain  general  (language  inclusive)  learning 
mechanisms  would  mean  that  language  and  other  cognitive  faculties,  at  least  to 
some  degree,  rely  on  the  same  machinery.  Arguments  for  a  language-specific 
processor  and  a  modular  language  faculty  (Chomsky  1980,  1986;  Pinker  1987, 
1993)  have  been  advanced  on  the  basis  of  "poverty  of  the  stimulus"  arguments-that 
learning  (as  we  now  understand  it)  cannot  account  for  such  a  sophisticated 
knowledge  base,  and  consequently,  humans  must  be  born  with  the  principles  of 
language  structure.  Yet,  because  no  existent  theory  of  learning  can  adequately 
account  for  linguistic  knowledge  does  not  mean  one  cannot  be  developed.  Here 
again,  I  believe  research  in  implicit  learning  has  a  contribution  to  make.  Instead  of 
language  acquisition  being  constrained  by  innately  given  language  structures,  it  is 
possible  that  what  are  biologically  determined  are  cognitive  mechanisms  which  are 
able  to  implicitly  extract  complexly  structured  information46.  That  is  to  suggest  that 
it  is  not  the  principles  of  linguistic  structure  that  are  innate,  but  the  ability  to  learn 
them  implicitly. 


Winter  and  Reber  (1994)  extend  the  hypothesis  of  domain  general  implicit  learning  to  include 
language  learning.  They  suggest  that  there  are  general  cognitive  mechanisms  which  allow  the 
child  to  extract  the  underlying  regularities  from  the  surface  and  construct  an  implicit  model  that 
mirrors  base  structure.  The  assumption  here  is  that  there  exists  a  biological  predisposition  for 
implicit  learning  and  that  learning  language  is  similar  to  learning  other  types  of  complex 
information  in  that  they  are  all  a  result  of  such  innate  general  learning  mechanisms  Accordingly 
the  linguistic  system  that  the  child  acquires  is  a  result  of  a  general  cognitive  ability  to  abstract  out 
the  regularities  in  the  communicative  environment  (linguistic,  pragmatic,  social),  and  so  generate 
novel  utterances  and  understand  similnrlv  n™  ,.i  ;„„„*<■ 


novel  utterances  and  understand  similarly  novel  inputs. 
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